This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the 
original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problems Mailbox. 




P/^T WORLD'INTELIJECTUAL PROPERTY 0RGA^FIZAHOM 

A X Inlemational Bureau ^ 

INTERNATIONAL APPUCATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) international Patent Classtflcation ^ : 

C12N 15/86, 7/00» lS/88» A61K 48/00 // 
C07K 14/47 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 96A3597 

9 May 1996 (09.05.96) 



(21) International Application Number: i>Cr/US95/14017 

(22) International Filing Date: 27 October 1995 (27.10.95) 



(30) Priority Data: 
08/331381 



28 October 1994(28.10.94) 



US 



(60) Parent Application or Grant 
(63) Related by Continuation 
US 

Filed on 



08/331.381 (CIP) 
28 October 1994 (28.10.94) 



(71) Applicant (for all designated States except US): THE 

TRUSTEES OF THE UNIVERSITY OF PENNSYLVA- 
NIA [US/US]: 133 South 36th Street, Philadelphia, PA 
19104-3246 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): WILSON. James. M. 
[US/US]; 1350 N, Avignon Drive, Gladwync, PA 19035 
(US). FISHER. Krishna. J. [USAJS]; 4006 Pine Street. 
Philadelphia. PA 19104 (US). CHEN. Shu-Jcn [-/US]; 
3901 Conshohockcn Avenue. Philadelphia. PA 19131 (US). 
WETTZMAN. Matdicw [GB/US]; 301 S. 19th Street #2A, 
Philadelphia. PA 19103 (US). 



(74) Agents: BAK. Maty. E. el al.; Howson and Howson, Spring 
House Coiporate Center, P.O. Box 457, Spring House, PA 
19477 (US). 



(81) Designated States: AL. AM. AU. BB, BG. BR. BY. CA, CN. 
CZ. EE, FI. GE. HU, IS. JP. KE. KG. KP. KR. K2. LK. 
lA, LT, LV. MD. MG. MK, MN. MW, MX. NO. NZ. PL. 
RO, RU. SD, SG. SI, SK, TJ. TM. TT. UA. UG. US. UZ, 
VN, European patent (AT, BE CH, DE, DK. ES, FR. GB. 
GR. IE, IT, LU. MC, NL, FT, SE), OAPI patent (BF. BJ, 
CF. CG. CI, CM, GA. GN, ML, MR, NE, SN, TD. TG). 
ARIPO patent (KE. LS, MW, SD. SZ. UG). 



Published 

Without international search report and to be republished 
upon receipt of that report 



(54) Title: IMPROVED ADENOVIRUS AND METHODS OF USE THEREOF 



(57) Abstract 

A recombinant adenovinis and a method for producing the vims are provided which utilize a recombinant shuttle vector comprising 
adenovinis DNA sequence for the 5' and 3' cis-elements necessary for replication and viricHi encapsidation In the absence of sequence 
encoding viral genes and a selected minigene luiked thereto, and a helper adenoviras comprising sufficient adenovinis gene sequences 
necessary for a productive viral infection. Desirably the helper gene Is crippled by modifications to its 5' packaging sequences, which 
facilitates purification of the viral particle from the helper vims. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 


GB 


United Kingdom 


MR 


Mauiitanta 


AU 


Ausinlia 


GE 


Geoigia 


MW 


Kfalawi 


BB 


Baibados 


GN 


Guinea 


NE 


Niger 


BB 


Belgium 


GR 


Greece 


NL 


Netherlands 


BF 


Burkina Paso 


HU 


Hung try 


NO 


Norway 


BG 


Bulgaria 


IE 


Ireland 


NZ 


New Zealand 


BJ 


Benin 


IT 


Italy 


PL 


Poland 


BR 


Brazil 


JP 


Japan 


PT 


Portugal 


BY 


Bclanu 


KB 


Kenya 


RO 


Romania 


CA 


Canada 


KG 


Kyrgyitan 


Rtj 


Ruttian FMeration 


CF 


Central African Republic 


KP 


Democratic People'i Republic 


SD 


Sudan 


CG 


Congo 




of Korea 


SB 


Sweden 


CH 


Switierlaiid 


KR 


Republic of Koita 


SI 


Slovenia 


a 


Cfite d*Ivoiic 


KZ 


Kazakhstan 


SK 


Slovakia 


CM 


Cuneiooo 


U 


Liechtenstein 


SN 


Senegal 


CN 


China 


LK 


SriLanlu 


TD 


Chad 


CS 


Cxcchoslovakia 


LU 


Luxembourg 


TG 


Togo 


CZ 


Czech Republic 


LV 


Latvia 


TJ 


Tajikistan 


DE 


Getmany 


MC 


Monaco 


TT 


Trinidad and Tobago 


DK 


Denniaik 


MD 


Republic of Moldova 


UA 


Ukraine 


ES 


Spain 


MG 


Madagascar 


US 


United States of Atnerica 


n 


Finland 


ML 


Mali 


UZ 


Uzbekistan 


FR 


France 


MN 


Mongolia 


VN 


Viet Nam 


GA 


Giboo 











wo 96/13597 PCT/US95/14017 



1 

IMPROVED ADENOVIRUS AND METHODS OF USE THEREOF 

This invention was supported by the National 
Institute of Health Grant No. P30 DK 47757. The United 
5 States government has rights In this Invention. 

Field of the Invention 

The present invention relates to the field of 
vectors useful in somatic gene therapy and the production 
10 thereof. 

Background of the Invention 

Human gene therapy is an approach to treating human 
disease that Is based on the modification of gene 

15 expression in cells of the patient. It has become 
apparent over the last decade that the single most 
outstanding barrier to the success of gene therapy as a 
strategy for treating inherited diseases, cancer, and 
other genetic dysfunctions is the development of useful 

20 gene transfer vehicles. Eukaryotic viruses have been 

employed as vehicles for somatic gene therapy. Among the 
viral vectors that have been cited frec[uently in gene 
therapy research are adenoviruses. 

Adenoviruses are eukaryotic DNA viruses that can be 

25 modified to efficiently deliver a therapeutic or reporter 
transgene to a variety of cell types. Recombinant 
adenoviruses types 2 and 5 (Ad2 and Ad5, respectively), 
which cause respiratory disease in humans, are currently 
being developed for gene therapy. Both Ad2 and Ad5 

30 belong to a subclass of adenovirus that are not 
associated with human malignancies. Recombinant 
adenoviruses are capable of providing extremely high 
levels of transgene delivery to virtually all cell types, 
regardless of the mitotic state. High titers (10^^ 

35 plaqu forming units/ml) f recombinant virus can be 

easily generated in 293 cells (the adenovirus gulvalent 
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to retrovirus packaging cell lines) and cryo-stored for 
extended periods without appreciable losses. The 
efficacy of this system in delivering a therapeutic 
transgene in vivo that conpleiBents a genetic imbalance 
5 has been demonstrated in anima^ models of various 
disorders [Y. Watanabe, fttll^^Q^g^^'^o^^^- ifi: 261-268 
(1986); K. Tanzawa et al, FEBS IteU?yg> lia(l)!81-84 
(1980); J.L. Golasten et al, y^w Enal. J. Med., 
222(11983) : 288-296 (1983); S. Ishibashi et al, Jt C l int 
10 invest. . 2^:883-893 (1993); and S. Ishibashi et al, i. 
gitn. Invest. . 21:1885-1893 (1994)]. Indeed, a 
recombinant replication defective adenovirus encoding a 
cDNA for the cystic fibrosis tr^smembrane regulator 
(CFTR) has been approved for use in at least two human CF 
15 clinical trials [see, e.g., J. Wilson, Hfi^UES, 3fiS:691- 
692 (Oct. 21, 1993)]. Further support of the safety of 
recombinant adenoviruses for gene therapy is the 
extensive experience of live adenovirus vaccines in human 
populations . 

20 Human adenoviruses are comprised of a linear, 

approximately 36 kb double-stranded DHA genome, which is 
divided into 100 map units (m.u.), each of which is 360 
bp in length. The DMA contains short inverted terminal 
repeats (ITR) at each end of the genome that are required 

25 for viral DNA replication. The gene products are 

organized into early (El through E4) and late (LI through 
L5) regions, based on expression before or after the 
initiation of viral DNA synthesis [see, e.g., Horwitz, 
vlroleav . 2d edit., ed. B. N. Fields, Raven Press, Ltd. , 

30 New York (1990)]. 

The first-generation recombinant, replication- 
deficient adenoviruses which have been developed for gene 
therapy contain del tions of the entire Ela and part of 
the Elb regions. This replication-defectiv virus is 

35 grown on an adenovirus-transf ormed, compl mentation human 
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embryonic kidney cell line containing a functional 
adenovirus Ela gene which provides a transacting Ela 
protein, the 293 cell [ATCC CRL1573]. El-deleted viruses 
are capable of replicating and producing infectious virus 
5 in the 293 cells, which previa^ Ela and Elb region gene 
products in trans. The resulting virus is capable of 
infecting many cell types and can express the introduced 
gene (providing it carries its own promoter) , but cannot 
replicate in a cell that does not carry the El region DNA 

10 unless the cell is infected at a very high multiplicity 
of infection. 

However, in vivo studies revealed transgene 
expression in these El deleted vectors was transient and 
invariably associated with the development of severe 

15 inflammation at the site of vector targeting [S. 

Ishibashi et al, J> Clin. Invest .. 21:1885-1893 (1994); 
J. M. Wilson et al, Proc. Natl , Aead. Sei., USA. ££:4421- 
4424 (1988); J. M. Wilson et al, Clin. Bio, . 1:21-26 
(1991); M. Grossman et al, Som. Cell- and Mol. Gen.. 

20 12:601-607 (1991)]. One explanation that has been 

proposed to explain this finding is that first generation 
recombinant adenoviruses, despite the deletion of El 
genes, express low levels of other viral proteins. This 
could be due to basal expression from the unstimulated 

25 viral promoters or trans activation by cellular factors. 
Expression of viral proteins leads to cellular immune 
responses to the genetically modified cells, resulting in 
their destruction and replacement with nontransgene 
containing cells. 

30 There yet remains a need in the art for the 

development of additional adenovirus vector constructs 
for gene therapy. 



35 
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g«mmAi»Y Invention 

In one aspect, the invention provides the components 
of a novel reconbinant adenovirus production system. One 
component is a shuttle plasmid, pAdA, that comprises 
5 adenovirus cis-elements necessary for replication and 
virion encapsidation and is deleted of all viral genes. 
This vector carries a selected transgene under the 
control of a selected promoter and other conventional 
vector/plasmid regulatory coiqponents. The other 
10 component is a helper adenovirus, which alone or with a 
paclcaging cell line, supplies sufficient gene sequences 
necessary for a productive viral infection. In a 
preferred embodiment, the helper virus has been altered 
to contain modifications to the native gene sequences 
15 which direct efficient packaging, so as to substantially 
disable or "cripple" the paclcaging function of the helper 
virus or its ability to replicate. 

In another aspect, the present invention provides a 
unique recombinant adenovirus, an AdA virus, produced by 
20 use of the components above. This recombinant virus 

comprises an adenovirus capsid, adenovirus cis-elements 
necessary for replication and virion encapsidation, but 
is deleted of all viral genes (i.e., all viral open 
reading frames) . This virus particle carries a selected 
25 transgene under the control of a selected promoter and 
other conventional vector regulatory components. This 
AdA recombinant virus is characterized by high titer 
transgene delivery to a host Cell and the ability to 
stably integrate the transgene into the host cell 
30 chromosome. In one embodiment, the virus carries as its 
transgene a reporter gene. Another embodiment of the 
recombinant virus contains a therapeutic transgene. 

In another aspect, the inv ntion pr vides a method 
for producing the above-described recombinant AdA virus 
35 by co-transf cting a cell line (either a packaging cell 
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line or a non-packaging cell line) with a shuttle vector 
or plasmid and a helper adenovirus as described above, 
wherein the transfected cell generates the AdA virus* 
The AdA virus is subsequently isolated and purified 
5 therefrom • 

In yet a further aspect, the invention provides a 
method for delivering a selected gene to a host cell for 
expression in that cell by administering an effective 
amount of a recombinant AdA virus containing a 
10 therapeutic transgene to a patient to treat or correct a 
genetically associated disorder or disease. 

other aspects and advantages of the present 
invention are described further in the following detailed 
description of the preferred embodiments thereof. 

15 

Brief Description of the Figures 

Fig. lA is a schematic representation of the 

organization of the major functional elements that define 

the 5* terminus from Ad5 including an inverted terminal 
20 repeat (ITR) and a packaging/enhancer domain. The TATA 

box of the El promoter (black box) and BIA 

transcriptional start site (arrow) are also shown. 
Fig. IB is an expanded schematic of the 

packaging/ enhancer region of Fig. lA, indicating the five 
25 packaging (PAC) domains (A-repeats) , I through V. The 

arrows indicate the location of PGR primers referenced in 

Figs. 9A and 9B below. 

Fig. 2 A is a schematic Of shuttle vector 

pAdA.CMVLacZ containing 5* ITR from Ad5, followed by a 
30 CMV promoter /enhancer, a LacZ gene, a 3* ITR from AdS, 

and remaining plasmid sequence from plasmid pSP72 

backbone. Restriction endonuclease enzymes are 

represented by conv ntional d signations in the plasmid 

constructs. 

35 
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Fig. 2B is a schematic of the shuttle vector 
digested with EcoRI to release the modified AdA genome 
from the pSP72 plasmid backbone. 

Fig. 2C is a schematic depiction of the function of 
5 the vector system. In the pre.,ence of an El-deleted 

helper virus Ad.CBhpAP which encodes a reporter minigene 
for human placenta alkaline phosphatase (hpAP) , the 
AdA.CMVLacZ genome is packaged into preformed virion 
capsids, distinguishable from the helper virions by the 
10 presence of the LacZ gene. 

Figs. 3 A to 3F [SEQ ID NO; 1] report the top DMA 
strand of the double-stranded plasmid pAdA . CMVLacZ . The 
complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
15 components: 3' Ad ITR (nucleotides 607-28 of SEQ ID NO: 
1); the 5 • Ad ITR (nucleotides 5496-5144 of SEQ ID NO: 
1) ; CMV promoter/ enhancer (nucleotides 5117-4524 of SEQ 
ID NO: 1) ; SD/SA sequence (nucleotides 4507-4376 of SEQ 
ID NO: 1); LacZ gene (nucleotides 4320-845 of SEQ ID NO: 
20 1) ; and a poly A sequence (nucleotides 837-639 of SEQ ID 
NO: 1) . 

Fig. 4A is a schematic of shuttle vector 
pAdAc.cafVLacZ containing an Ad5 5' ITR and 3* ITR 
positioned head-to-tail, with a CMV enhancer /promoter- 

25 LacZ minigene immediately following the 5' ITR, followed 
by a plasmid pSP72 (Promega) backbone. Restriction 
endonuclease enzymes are represented by conventional 
designations in the plasmid constructs. 

Fig. 4B is a schematic depiction of the function of 

30 the vector system of Fig. 4A. In the presence of helper 
virus Ad.CBhpAP, the circular pADAc.CMVLacZ shuttle 
vector sequence is packaged into virion heads, 
distinguishable from the helper virions by the presence 
of the LacZ gene. 

35 
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Pigs, 5A to 5F [SEQ ID NO: 2] report the top DNA 
strand of the double-stranded vector pAdAc.GKVLacZ. The 
complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
5 components: 5* Ad ITR (nucleotides 600-958 of SEQ ID NO: 
2); CMV promoter/enhancer (nucleotides 969-1563 of SEQ ID 
NO: 2); SD/SA sequence (nucleotides 1579-1711); LacZ gene 
(nucleotides 1762-5236 of SEQ ID NO: 2) ; poly A sequence 
(nucleotides 5245-5443 of SEQ ID NO: 2); and 3» Ad ITR 

10 (nucleotides 16-596 of SEQ ID NO: 2) . 

Fig. 6 is a schematic of shuttle vector pAdA.CBCFTR 
containing 5» ITR from Ad5, followed by a chimeric CMV 
enhancer/ B act in promoter enhancer, a CFTR gene, a poly-A 
sequence, a 3* ITR from Ad5, cmd remaining plasmid 

15 sequence from plasmid pSLllSO (Pharmacia) backbone. 
Restriction endonuclease enzymes are represented by 
conventional designations in the plasmid constructs. 

Figs. 7A to 7H [SEQ ID NO: 3] report the top DNA 
strand of the double-stranded plasmid pAdA.CBCFTR. The 

20 complementary sequence may be readily obtained by one of 
skill in the art. The sequence includes the following 
components: 5« Ad ITR (nucleotides 9611-9254 of SEQ ID 
NO: 3); chimeric CMV enhancer/fi actin promoter 
(nucleotides 9241-8684 of SEQ ID NO: 3); CFTR gene 

25 (nucleotides 8622-4065 of SEQ ID NO: 3); poly A sequence 
(nucleotides 3887-3684 of SEQ ID NO: 3); and 3* Ad ITR 
(nucleotides 3652-3073 of SEQ ID NO: 3). The remaining 
plasmid backbone is obtained from pSLllSO (Pharmacia) . 

Fig. 8A illustrates the generation of 5* adenovirus 

30 terminal sequence that contained PAC domains I and II by 
PCR. See, arrows indicating righthand and lefthand (PAC 
II) PGR probes in Fig. IB. 



35 
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Fig. 8B illustrates the generation of 5' terminal 
sequence that contained PAC domains I, II, III and IV by 
PCR. See, arrows indicating righthand and lefthand (PAC 
IV) PCR probes in Fig. IB. 
5 Fig. 8C depicts the amplification products subcloned 

into the multiple cloning site of pAd.Link.l (IHGT Vector 
core) generating pAd.PACII (domains I and II) and 
pAd.PACIV (domains I, II, III, and IV) resulting in 
crippled helper viruses, Ad.PACII and Ad.PACIV with 
10 modified packaging (PAC) signals. 

Fig. 9A is a schematic representation of the 
subcloning of a human placenta alkaline phosphatase 
reporter minigene containing the immediate early CMV 
enhancer/ promoter (CMV) , human placenta alkaline 
15 phosphatase cDHA (hpAP) , and SV40 polyadenylation signal 
(pA) into pAd.PACII to result in crippled helper virus 
vector pAdA.PACII.CMVhpAP. Restriction endonuclease 
enzymes are represented by conventional designations in 
the plasmid constructs. 
20 Fig. 9B is a schematic representation of the 

subcloning of the same minigene of Fig. 9A into pAd.PACIV 
to result in crippled helper virus vector 
pAd . PACI V . CMV . hpAP . 

Fig. 10 is a flow diagram summarizing the synthesis 
25 of an adenovirus-based polycation helper virus conjugate 
and its combination with a pAdA shuttle vector to result 
in a novel viral particle complex. CsCl band purified 
helper adenovirus was reacted with the heterobifunctional 
crosslinker sulfo-SMCC and the capsid protein fiber is 
30 labeled with the nucleophilic maleimide moiety. Free 
sulfhydryls were introduced onto poly-L- lysine using 2- 
iminothiolane-HCl and mixed with the labelled adenovirus, 
resulting in the helper virus conjugate Ad-pLys. A 
unique adenovirus-bas d particle is g nerated by 
35 purifying the Ad-pLys conjugate over a CsCl gradient to 
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remove unincorporated poly-L- lysine, followed by 
extensively dialyzing, adding shuttle plasmid DNAs to Ad- 
pLys and allowing the conplex formed by the shuttle 
plasmid wrapped around Ad-pLys to develop. 
5 Fig. 11 is a schematic diagram of pCCL-DMD, which is 

described in detail in Example 9 below. 

Fig. 12 A - 12P provides the continuous DNA sequence 
of pAdA.CHVmDys [SEQ ID NO:10]. 

10 pgtailgfl PgggriptiQn 9t th? InvgntlQn 

The present invention provides a unique recombinant 
adenovirus capable of delivering transgenes to target 
cells, as well as the components for production of the 
unique virus and methods for the use of the virus to 

15 treat a variety of genetic disorders. 

The AdA virus of this invention is a viral particle 
containing only the adenovirus cis-elements necessary for 
replication and virion encapsidation (i.e., ITRs and 
packaging sequences) , but otherwise deleted of all 

20 adenovirus genes (i.e., all viral open reading frames). 

This virus carries a selected transgene under the control 
of a selected promoter and other conventional regulatoxy 
components, such as a poly A signal. The AdA virus is 
characterized by improved persistence of the vector DNA 

25 in the host cells, reduced antigenicity/ immunogenicity, 
and hence, improved performance as a delivery vehicle. 
An additional advantage of this invention is that the AdA 
virus permits the packaging of very large transgenes, 
such as a full-length dystrophin cDNA for the treatment 

30 of the progressive wasting of muscle tissue 

characteristic of Duchenne Muscular Dystrophy (DMD) . 

This novel recombinant virus is produced by use of 
an ad novirus-based v ctor production system containing 
two components: i) a shuttle vector that compris s 

35 adenovirus cis*-elements necessary for r plication and 
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Virion encapsidation and is deleted of all viral genes, 
which vector carries a reporter or therapeutic minigene 
and 2) a helper adenovirus which, alone or with a 
packaging cell line, is capable of providing all of the 

5 viral gene products necessary ror a productive viral 
infection when co-transfected with the shuttle vector. 
Preferably, the helper virus is nodified so that it does 
not package itself efficiently. In this setting, it is 
desirably used in combination with a packaging cell line 

10 that stably expresses adenovirus genes. The methods of 
• producing this viral vector from these components include 
both a novel means of packaging of an 

adenoviral/transgene containing vector into a virus, and 
a novel method for the subsequent separation of the 
15 helper virus from the newly foraed recombinant virus. 

X. The Shuttle Veeter 

The shuttle vector, referred to as pAdA, is coursed 
of adenovirus sequences, and transgene sequences, 
20 including vector regulatory control sequences. 
^ The Adenoy V*tt« SemiancBS 

The adenovirus nucleic acid sequences of the 
shuttle vector provide the minimum adenovirus sequences 
which enable a viral particle to be produced with the 
25 assistance of a helper virus. These sequences assist in 
delivery of a recombinant transgene genome to a target 
cell by the resulting recombinant virus. 

The DNA sequences of a number of adenovirus 
types are available from Genbank, including type Ad5 
30 [Genbank Accession No. H73260]. The adenovirus sequences 
may be obtained from any known adenovirus serotype, such 
as serotypes 2, 3, 4, 7, 12 and 40, and further including 
any of the presently identified 41 human types [see, 
e.g., Horwitz, cit d above]. Similarly adenoviruses 
35 known to infect other animals may also be mployed in the 
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vector constructs of this invention. The selection of 
the adenovirus type is not anticipated to limit the 
following invention. A variety of adenovirus strains are 
available from the American Type Culture Collection, 
5 Rockville, Maryland, or available by request from a 

variety of commercial and institutional sources. In the 
following exemplary embodiment an adenovirus, type 5 
(Ad5) is used for convenience. 

However, it is desirable to obtain a variety of 

10 pAdA shuttle vectors based on different human adenovirus 
serotypes. It is anticipated that a library of such 
plasmids and the resulting AdA viral vectors would be 
useful in a therapeutic regimen to evade cellular, and 
possibly humoral, immunity, and lengthen the duration of 

15 transgene expression, as well as improve the success of 
repeat therapeutic treatments. Additionally the use of 
various serotypes is believed to produce recombinant 
viruses with different tissue targeting specificities. 
The absence of adenoviral genes in the AdA viral vector 

20 is anticipated to reduce or eliminate adverse CTL 

response which normally causes destruction of recombinant 
adenoviruses deleted of only the El gene. 

Specifically, the adenovirus nucleic acid 
sequences employed in the pAdA shuttle vector of this 

25 invention are adenovirus genomic sequences from which all 
viral genes are deleted. More specifically, the 
adenovirus sequences employed are the cis-acting 5' and 
3* inverted terminal repeat (ITR) sequences of an 
adenovirus (which function as origins of replication) and 

30 the native 5* packaging/enhancer domain, that contains 
sequences necessary for packaging linear Ad genomes and 
enhancer elements for the El promoter. These sequences 
ar the sequ nces necessary for replication and virion 
encapsidation. See, e.g., P. Hearing et al, J. Virol. . 

35 £1(8) :2555-2558 (1987); M. Grable and P. Hearing, 
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Virol . . M(5): 2047-2056 (1990); and M. Grable and P. 
Hearing, J- vlrol.. fifi(2) :723-731 (1992). 

According to this invention, the entire 
adenovirus 5* sequence containing the 5' ITR and 
5 packaging/ enhancer region can be employed as the 5* 
adenovirus sequence in the pAdA shuttle vector. This 
left terminal (5') sequence of the Ad5 genome useful in 
this invention spans 1 to about 360 of the 
conventional adenovirus genome, also referred to as map 
10 units 0-1 of the viral genome. This sequence is provided 
herein as nucleotides 5496-5144 of SEQ ID NO: 1, 
nucleotides 600-958 of SEQ ID NO: 2; and nucleotides 
9611-9254 of SEQ ID NO: 3, and generally is from about 
353 to about 360 nucleotides in length. This sequence 
15 includes the 5* ITR (bp 1-103 of the adenovirus genome), 
and the packaging/enhancer domain (bp 194-358 of the 
adenovirus genome). See, Figs. lA, 3, 5, and 7. 

Preferably, this native adenovirus 5' region is 
employed in the shuttle vector in unmodified form. 
20 However, some modifications including deletions, 

substitutions and additions to this sequence which do not 
adversely effect its biological function may be 
acceptable. See, e.g., WO 93/24641, published December 
9, 1993. The ability to modify these ITR sequences is 
25 within the ability of one of skill in the art. See, 

e.g., texts such as Sambrook et al, "Molecular Cloning. 
A Laboratory Manual.", 2d edit.. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1989). 

The 3* adenovirus sequences of the shuttle 
30 vector include the right terminal (3*) ITR sequence of 
the adenoviral genome spanning about bp 35,353 - end of 
the adenovirus genome, or map units "98.4-100. This 
sequence is provided her in as nucleotides 607-28 of SEQ 
ID NO: 1, nucleotides 16-596 of SEQ ID NO: 2; and 
35 nucl otides 3652-3073 of SEQ ID NO: 3, and g n rally is 
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about 580 nucleotides in length. This entire seGpience is 
desirably employed as the 3* sequence of an pAdA shuttle 
vector. Preferably, the native adenovirus 3* region is 
employed in the shuttle vector in tmmodified form. 
5 However, some modifications to this sequence which do not 
adversely effect its biological function may be 
acceptable. 

An exemplary pAdA shuttle vector of this 
invention, described below and in Fig. 2A, contains only 

10 those adenovirus sequences required for packaging 

adenoviral genomic DNA into a preformed capsid head. The 
pAdA vector contains Ad5 sequences encoding the 5* 
terminal and 3* terminal sequences (identified in the 
description of Fig. 3) , as well as the transgene 

15 sequences described below. 

From the foregoing information, it is expected 
that one of skill in the art may employ other equivalent 
adenovirus sequences for use in the AdA vectors of this 
invention. These sequences may include other adenovirus 

20 strains, or the above mentioned cis-acting sequences with 
minor modifications. 

B. ThB Transgene 

The transgene sequence of the vector and 
recombinant virus is a nucleic acid sequence or reverse 

25 transcript thereof, heterologous to the adenovirus 
sequence, which encodes a polypeptide or protein of 
interest. The transgene is operatively linked to 
regulatory components in a manner which permits transgene 
transcription . 

30 The composition of the transgene sequence will 

depend upon the use to which the resulting virus will be 
put. For example, one type of transgene sequence 
includes a report r s quence, which upon expression 
produces a det ctable signal. Such reporter sequences 

35 include without limitation an E. coli beta-galactosidase 
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(LacZ) cDNA, a human placental alkaline phosphatase gene 
and a green fluorescent protein gene. These sequences, 
when associated with regulatory elements which drive 
their expression, provide signals detectable by 
5 conventional means, e.g., ultra /iolet wavelength 
absorbance, visible color change, etc. 

Another type of transgene sequence includes a 
therapeutic gene which expresses a desired gene product 
in a host cell. These therapeutic nucleic acid sequences 

10 typically encode products for administration and 

expression in a patient in vivo or ex vivo to replace or 
correct an inherited or non-inherited genetic defect or 
treat an epigenetic disorder or disease. Such 
therapeutic genes which are desirable for the performance 

15 of gene therapy include, without limitation, a normal 

cystic fibrosis transmembrane regulator (CFTR) gene (see 
Pig. 7) , a low density lipoprotein (LDL) gene [T. 
Yamamoto et al. Cell . 32527-28 (November, 1984)], a DMD 
cDNA sequence [partial sequences available from GenBank, 

20 Accession Nos. M36673, M36671, [A. P. Monaco et al. 

Nature - 22156^6-650 (1986)] and L06900, [Roberts et al. 
Hum. Mutat, , 2:293-299 (1993)]] (Genbank) , and a number 
of genes which may be readily selected by one of skill in 
the art* The selection of the transgene is not 

25 considered to be a limitation of this invention, as such 
selection is within the Icnowledge of the art-skilled. 
SLl R^aulAtorv ElBments 

In addition to the major elements identified 
above for the pAdA shuttle vector, i.e., the adenovirus 

30 sequerices and the transgene, the vector also includes 
conventional regulatory elements necessary to drive 
expression of the transgene in a cell transfected with 
the pAdA vector. Thus the vector contains a select d 
promoter which is linked to the transgene and located. 
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with the transgene, between the adenovirus sequences of 
the vector. 

Selection of the promoter Is a routine matter 
and Is not a limitation of the pAdA vector Itself. 
5 Useful promoters may be const 1 cut Ive promoters or 
regulated (Inducible) promoters, which will enable 
control of the amount of the transgene to be expressed. 
For exaaqple, a desirable promoter Is that of the 
cytomegalovirus Immediate early promoter/ enhancer [see, 

10 e.g., Boshart et al, fisU, 41:521-530 (1985)]. This 

promoter Is found at nucleotides 5117-4524 of SEQ ID NO: 
1 and nucleotides 969-1563 of SEQ ID NO: 2. Another 
promoter Is the CMV enhancer/ chicken 6-actln promoter 
(nucleotides 9241-8684 of SEQ ID NO: 3). Another 

15 desirable promoter Includes, without limitation, the Rous 
sarcoma virus LTR promoter /enhancer. Still other 
promoter/ enhancer sequences may be selected by one of 
skill In the art. 

The shuttle vectors will also desirably contain 

20 nucleic acid sequences heterologous to the adenovirus 

seqpiences Including sequences providing signals reqpilred 
for efficient polyadenylation of the transcript and 
introns with functional splice donor and acceptor sites 
(SD/SA) . A common poly-A sequence which is employed in 

25 the exemplary vectors of this Invention is that derived 
from the papovavirus SV-40 [see, e.g., nucleotides 837- 
639 of SEQ ID NO: 1; 5245-5443 of SEQ ID NO: 2; and 3887- 
3684 Of SEQ ID NO: 3]. The pbly-A sequence generally is 
Inserted In the vector following the transgene sequences 

30 and before the 3* adenovirus sequences. A common Intron 
sequence is also derived from SV-40, and is referred to 
as the SV-40 T Intron sequence [see, e.g., nucleotides 
4507-4376 of SEQ ID NO: 1 and 1579-1711 of SEQ ID NO: 2]. 
A pAdA shuttle vector f th pr s nt Invention may also 

35 contain such an Intron, desirably locat d between th 
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promoter /enhancer sequence and the transgene. Selection 
of these and other common vector elements are 
conventional and many such sequences are available [see, 
e.g., Sambrook et al, and references cited therein]. 
5 Examples of such regulatory se«,uences for the above are 
provided in the plasmid sequences of Figs. 3, 5 and 7. 

The combination of the transgene, promoter/ 
enhancer, the other regulatory vector elements are 
referred to as a "minigene" for ease of reference herein. 

10 The minigene is preferably flanked by the 5' and 3* cis- 
acting adenovirus sequences described above. Such a 
minigene may have a size in the range of several hundred 
base pairs up to about 30 kb due to the absence of 
adenovirus early and late gene sequences in the vector, 

15 Thus, this AdA vector system permits a great deal of 
latitude in the selection of the various components of 
the minigene, particularly the selected transgene, with 
regard to siae. Provided with the teachings of this 
invention, the design of such a minigene can be made by 

20 resort to conventional techniques. 



XZ. The Helper Virus 

Because of the limited amount of adenovirus sequence 
present in the AdA shuttle vector, a helper adenovirus of 

25 this invention must, alone or in concert with a packaging 
cell line, provide sufficient adenovirus gene sequences 
necessary for a productive viral infection. Helper 
viruses useful in this invention thus contain selected 
adenovirus gene sequences, and optionally a second 

30 reporter minigene. 

Normally, the production of a recombinant adenovirus 
which utilizes helper adenovirus containing a full 
complem nt of adenoviral genes r suits in r combinant 
virus contaminated by xcess production of the helper 

35 virus. Thus, xt nsive purification of the viral vector 
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from the contaminating helper virus is required. 
However, the present invention provides a way to 
facilitate purification and reduce contamination by 
crippling the helper virus. 
5 One preferred embodiment of a helper virus of this 

invention thus contains three components (A) 
modifications or deletions of the native adenoviral gene 
sequences which direct efficient packaging, so as to 
substantially disable or "cripple" the packaging function 

10 of the helper virus or its ability to replicate, (B) 

selected adenovirus genes and (C) an optional reporter 
minigene. These **crippled** helper viruses may also be 
formed into poly-cation conjugates as described below. 
The adenovirus sequences forming the helper virus 

15 may be obtained from the sources identified above in the 
discussion of the shuttle vector. Use of different Ad 
serotypes as helper viruses enables production of 
recombinant viruses containing the AAd (serotype 5) 
shuttle vector sequences in a capsld formed by the other 

20 serotype adenovirus. These recombinant viruses are 

desirable in targeting different tissues, or evading an 
Immune response to the AAd sequences having a serotype 5 
capsld. Use of these different Ad serotype helper 
viruses may also demonstrate advantages in recombinant 

25 virus production, stability and better packaging. 

Ai. Th^ Crippling tfptfirigfltignjg 

A desirable helper virus used in the production 
of the adenovirus vector of this invention is modified 
(or crippled) in its 5* ITR packaging/enhancer domain, 
30 identified above. As stated above, the 

packaging/enhancer region contains sequences necessary 
for packaging linear adenovirus genomes (**PAC** 
sequences) . Nor specifically, this sequence contains at 
least seven distinct y t functionally redundant domains 
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that are required for efficient encapsidation of 
replicated viral DNA. 

Within a stretch of nucleotide sequence from bp 
194-358 of the Ad5 genone, five of these so-called A- 

5 repeats or PAC sequences are localized (see. Fig. IB) . 
PAC I is located at bp 241-248 of the adenovirus genome 
(on the strand complementary to nucleotides 5259-5246 of 
SEQ ID NO: 1) . PAC II is located at bp 262-269 of the 
adenovirus genome (on the strand complementary to 
10 nucleotides 5238-5225 of SEQ ID HO: 1). PAC III is 

located at bp 304-311 of the adenovirus genome (on the 
strand complementary to nucleotides 5196-5183 of SEQ ID 
NO: 1). PAC IV is located at bp 314-321 of the 
adenovirus (on the strand complementeury to nucleotides 

15 5186-5172 of SEQ ID HO: 1). PAC V is located at bp 339- 
346 Of the adenovirus (on the strand coaplementary to 
nucleotides 5171-5147 of SEQ ID NO: 1) . 

Corresponding sequences can be obtained from 
SEQ ID NO: 2 and 3. PAC I is located at nucleotides 837- 

20 851 of SEQ ID NO: 2; and on the strand complementary to 

nucleotides 9374-9360 of SEQ ID NO: 3. PAC II is located 
at nucleotides 859-863 of SEQ ID NO: 2; and on the strand 
complementary to nucleotides 9353-9340 of SEQ ID NO: 3. 
PAC III is located at nucleotides 901-916 of SEQ ID NO: 

25 2; and on the strand cwnplementary to nucleotides 9311- 
9298 of SEQ ID NO: 3. PAC IV is located at nucleotides 
911-924 of SEQ ID NO: 2; and on the strand complementary 
to nucleotides 9301-9288 of SEQ ID NO: 3. PAC V is 
located at nucleotides 936-949 of SEQ ID NO: 2; and on 

30 the strand complementary to nucleotides 9276-9263 of SEQ 
ID NO: 3. 
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Table 1 below lists these five native Ad5 
sequences emd a consensus PAC sequence based on the 
similarities between an eight nucleic acid stretch within 
the five sequences. The consensus sequence contains two 
positions at which the nucleic acid nay be A or T (A/T) » 
The conventional single letter designations are used for 
the nucleic acids, as is known to the art. 

Table 1 

Adenovirus Genome 
Base Pair Nos. & 
ft^pepegtt Nucleotide sequence 

15 241 248 

I TAG TAAATTT6 GGC [SEQ ID NO: 4] 



10 



20 



262 269 

II A6T AA6ATTT6 6CC [SEQ ID NO: 5] 

304 311 

III A6T GAAATCTG AAT [SEQ ID NO: 6] 



314 321 

25 IV GAA TAATTTTG TGT [SEQ ID NO: 7] 

339 346 

V CGT AATATTTG TCP [SEQ ID NO: 8] 

30 Consensus 5* (A/T)AN(A/T)TTT6 3« [SEQ ID NO: 9] 

According to this invention, mutations or 
deletions may be made to one or more of these PAC 
sequences to generate desirable crippled helper viruses • 

35 A deletion analysis of the packaging domain revealed a 

positive correlation between encapsidation efficiency and 
the number of packaging A-repeats that were present at 
the 5* end of the genome. Modifications of this domain 
may include 5* adenovirus sequences which contain less 

40 than all five of the PAC sequences of Table 1. For 
example, only two PAC s quenc s may b pr s nt in the 
crippled virus, .g., PAC I and PAC II, PAC III and PAC 
IV, and so on. Deletions of selected PAC sequences may 
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involve deletion of contiguous or non-contiguous 
sequences. For exanple, PAC II and PAC rv may be 
deleted, leaving PAC I, III and IV in the 5' sequence. 
Still an alternative nodification nay be the replacement 
5 of one or more of the native P*»C sequences with one or 
more repeats of the consensus sequence of Table 1. 
Alternatively, this adenovirus region may be modified by 
deliberately inserted mutations which disrupt one or more 
of the native PAC sequences. One of skill in the art may 
10 further manipulate the PAC sequences to similarly achieve 
the effect of reducing the helper virus packaging 
efficiency to a desired level. 

Exemplary helper viruses which involve the 
manipulation of the PAC sequences described above are 
15 disclosed in Example 7 below. Briefly, as described in 
that example, one helper virus contains in place of the 
native 5* ITR region (adenovirus genome bp 1-360), a 5' 
adenovirus sequence spanning adenovirus genome bp 1-269, 
which contains only the 5« ITR and PAC I and PAC II 
20 sequences, and deletes the adenovirus region bp 270-360. 

Another PAC sequence modified helper virus 
contains only the 5* Ad5 sequence of the ITR and PAC I 
through PAC IV (Ad bp 1-321) , deleting PAC V and other 
sequences in the Ad region bp322-360. 
25 These modified helper viruses are characterized 

by reduced efficiency of helper virus encapsidation. 
These helper viruses with the specific modifications of 
the sequences related to packaging efficiency, provide a 
packaging efficiency high enough for generating 
30 production lots of the helper virus, yet low enough that 
they permit the achievement of higher yields of AdA 
transducing viral particles according to this invention. 
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fij. The S3lected Adenovirus Genes 

Helper viruses useful in this invention, 
whether or not they contain the "crippling** modifications 
described above, contain selected adenovirus gene 
5 sequences depending upon the call line which is 

transfected by the helper virus and shuttle vector. A 
preferred helper virus contains a variety of adenovirus 
genes in addition to the modified sequences described 
above. 

10 As one example, if the cell line employed to 

' produce the recombinant virus is not a packaging cell 
line, the helper virus may be a wild type Ad virus. 
Thus, the helper virus supplies the necessary adenovirus 
early genes El, E2, E4 and all remaining late, 

15 intermediate, structural and non-structural genes of the 
adenovirus genome. This helper virus may be a crippled 
helper virus by incorporating modifications in its native 
5* packaging/enhancer domain. 

A desirable helper virus is replication 

20 defective and lacks all or a sufficient portion of the 

adenoviral early immediate early gene Ela (which spans mu 
1.3 to 4.5) and delayed early gene Elb (which spans mu 
4.6 to 11.2) so as to eliminate their normal biological 
functions. Such replication deficient viruses may also 

25 have crippling modifications in the packaging/enhancer 
domain. Because of the difficulty surrounding the 
absolute removal of adenovirus from AdA preparations that 
have been enriched by CsCl buoyant density 
centrifugation, the use of a replication defective 

30 adenovirus helper prevents the introduction of infectious 
adenovirus for in vivo animal studies. This helper virus 
is employed with a packaging cell line which supplies the 
deficient £1 proteins, such as the 293 c 11 line. 
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Additionally, all or a portion of the 
adenovirus delayed early gene E3 (which spans mu 76.6 to 
86.2) may be eliminated from the adenovirus sequence 
which forms a part of the helper viruses useful in this 
5 invention, without adversely axf acting the function of 
the helper virus because this gene product is not 
necessary for the formation of a functioning virus. 

m the presence of other packaging cell lines 
which are capable of supplying adenoviral proteins in 
10 addition to the El, the helper virus may accordingly be 
deleted of the genes encoding these adenoviral proteins. 

such additionally deleted helper viruses also desirably 
contain crippling modifications as described above. 
A Ranorter Miniaene 
15 ' It is also desirable for the helper virus to 

contain a reporter minigene, in which the reporter gene 
is desirably different from the reporter transgene 
contained in the shuttle vector. A number of such 
reporter genes are known, as referred to above. The 
20 presence of a reporter gene on the helper virus which is 
different from the reporter gene on the pAdA, allows both 
the recombinant AdA virus and the helper virus to be 
independently monitored. For example, the expression of 
recombinant alkaline phosphatase enables residual 
25 quantities of contaminating adenovirus to be monitored 
independent of recombinant LacZ expressed by an pAdA 
shuttle vector or an AdA virus. 

WATpay vij->.« PQivcation coniua&tes 
still another method for reducing the 
30 contamination of helper virus involves the formation of 
poly-cation helper virus conjugates, which may be 
associated with a plasmid containing other adenoviral 
g nes, which ar not present in the helper virus. The 
h Iper viruses described abov may be further modified by 
35 r sort to adenovirus-poly lysine conjugate technology. 
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See, e.g., Wu et al, J. Biol. Chem. . 264 :16985-16987 
(1989) ; and K. J. Fisher and J. M. Wilson, Blochem, J. . 
222: 49 (April l, 1994), incorporated herein by 
reference. 

5 Using this technology, a helper virus 

containing preferably the late adenoviral genes is 
modified by the addition of a poly-cation sequence 
distributed around the capsid of the helper virus. 
Preferably, the poly-cation is poly-lysine, which 

10 attaches around the negatively-charged vector to form an 
external positive charge. A plasmid is then designed to 
express those adenoviral genes not present in the helper 
virus, e.g., the El, E2 and/ or E4 genes. The plasmid 
associates to the helper virus-conjugate through the 

15 charges on the poly-lysine sequence. This modification 
is also desirably made to a crippled helper virus of this 
invention. This conjugate (also termed a trans-infection 
particle) permits additional adenovirus genes to be 
removed from the helper virus and be present on a plasmid 

20 which does not become incorporated into the virus during 
production of the recombinant viral vector. Thus, the 
impact of contamination is considerably lessened. 

ZZZ. Assembly of Shuttle Vector, Helper Virus and 

25 Prodttotion of Recombinant virus 

The material from which the sequences used in the 
pAdA shuttle vector and the helper viruses are derived, . 
as well as the various vector * components and sequences 
employed in the construction of the shuttle vectors, 

30 helper viruses, and AdA viruses of this invention, are 
obtained from commercial or academic sources based on 
previously published and described materials. These 
materials may also be btained from an individual patient 
or generated and selected using standard recosibinant 

35 molecular cloning techniques known and practiced by those 
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Skilled in the art. Any modification of existing nucleic 
acid sequences forming the vectors and viruses, including 
sequence deletions, insertions, and other mutations are 
also generated using standard techniques. 
5 Assembly of the selected DJA sequences of the 

adenovirus, and the reporter genes or therapeutic genes 
and other vector elements into the pAdA shuttle vector 
using conventional techniques is described in Example 1 
below. Such techniques include conventional cloning 
10 techniques of cDNA such as those described in texts 
[Sambrook et al, cited above], use of overlapping 
oligonucleotide sequences of the adenovirus genomes, 
polymerase chain reaction, and any suitable method %Aiich 
provides the desired nucleotide sequence. Standard 
15 transfection and co-transf action techniques are employed, 
e.g. , CaP04 transfection techniques using the HEK 293 
cell line. Other conventional methods eaqployed in this 
invention include homologous recombination of the viral 
genomes, plaquing of viruses in agar overlay, methods of 
20 measuring signal generation, and the like. Assembly of 
any desired AdA vector or helper virus of this invention 
is within the skill of the art, based on the teachings of 
this invention. 

Shuttle Vector 

25 As described in detail in Example 1 below and 

with resort to Fig. 2A and the DMA sequence of the 
plasmid reported in Fig. 3, a unique pAdA shuttle vector 
of this invention, pAdA.CMVLacZ, is generated. 
pAdA.CHVLacZ contains Ad5 sequences encoding the 5* 

30 terminal followed by a CMV promoter/ enhancer, a splice 
donor/splice acceptor sequence, a bacterial beta- 
galactosidase gene (LacZ) , a SV-40 poly A sequence (pA) , 
a 3» ITR from Ad5 and remaining plasmid s qu nee from 
plasmid pSP72 (Promega) backbone. 
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To generate the AdA genome which is 
incorporated in the vector, the plasmid pAdA.CMVLacZ must 
be must be digested with EcoRI to release the AdA.CMVLacZ 
genome, freeing the adenovirus ITRs and making them 
5 available targets for replication. Thus production of 
the vector is "■restriction-dependent", i.e., requires 
restriction endonuclease rescue of the replication 
template. See, Fig. 2B. 

A second type of pAdA plasmid was designed 

10 ^ich places the 3* Ad terminal sequence in a head-to- 
tail arrangement relative to the 5' teirminal sequence. 
As described in Example 1 and Figs. 4A, and with resort 
to the DNA sequence of the plasmid reported in Fig. 5, a 
second unique AdA vector sequence of this invention, 

15 AdAc.CMVLacZ, is generated from the shuttle plasmid 

pAdAc.CMVLacZ, which contains an Ad5 5* ITR sequence and 
3* ITR seqpaence positioned head-to-tail, followed by a 
CMV enhancer/ promoter, SD/SA sequence, LacZ gene and pA 
sequence in a plasmid pSP72 (Promega) backbone. As 

20 described in Example IB, this "restriction- independent" 
plasmid permits the AdA genome to be replicated and 
rescued from the plasmid backbone without including an 
endonuclease treatment (see. Fig. 4B) . 
Helper Vims 

25 As described in detail in Example 2, an 

exemplary conventional El deleted adenovirus helper virus 
is virus Ad.CBhpAP, which contains a 5* adenovirus 
sequence from mu 0-1, a reporter minigene containing 
human placenta alkaline phosphatase (hpAP) imder the 

30 transcriptional control of the chicken B-actin promoter, 
followed by a poly-A sequence from SV40, followed by 
adenovirus sequences from 9.2 to 78.4 and 86 to 100. 
This help r contain d del tions from mu 1.0 to 9.2 and 
78.4 to 86, which eliminate substantially the El region 

35 and the E3 region of the virus. This virus may be 
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desirably crippled according to this invention by 
modifications to its packaging enhancer doaain. 

Exemplary crippled helper viruses of this 
invention are described using the techniques described in 

5 Example 7 and contain the modi*.ied 5» PAC sequences, 
i.e., adenovirus genome bp 1-269; m.u. 0-0.75 or 
adenovirus genome bp 1-321; m.u. 0-0.89. Briefly, the 5« 
sequences are modified by PCR and cloned by conventional 
techniques into a conventional adenovirus based plasmid. 
10 A hpAP minigene is incorporated into the plasmid, which 
is then altered by homologous recombination with an E3 
deleted adenovirus dl700l to result in the modified 
vectors so that the reporter minigene is followed on its 
3« end with the adenovirus sequences mu 9.6 to 78.3 and 

15 87 to 100. 

Generation of a poly-L-lysine conjugate helper 
virus was demonstrated essentially as described in detail 
in Example 5 below and Pig. 10 by coupling poly-L-lysine 
to the Ad.CBhpAP virion capsid. Alternatively, the same 
20 procedure may be employed with the PAC sequence modified 
helper viruses of this invention. 
Qj^ RticombinBnt AdA Virus 

As stated above, a pAdA shuttle vector in the 
presence of helper virus and/or a packaging cell line 
25 permits the adenovirus-transgene sequences in the shuttle 
vector to be replicated and packaged into virion capsids, 
resulting in the recombinant AdA virus. The current 
method for producing such AdA virus is transfection-based 
and described in detail in Example 3. Briefly, helper 
30 virus is used to infect cells, such as the packaging cell 
line human HEK 293, which are then subsequently 
transfected with an pAdA shuttle vector containing a 
s lected transgene by conventional methods. About 30 or 
more hours post-trans feet ion, th cells are harvested, 
35 and an extract prepared. The AdA viral g noma is 
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packaged into virions that sediment at a lover density 
than the helper virus in cesium gradients. Thus, the 
recombinant AdA virus containing a selected transgene Is 
separated from the bulk of the helper virus by 
5 purification via buoyant densiwy ultracentrifugation in a 
CsCl gradient. 

The yield of AdA transducing virus is largely 
dependent on the number of cells that are transfected 
with the pAdA shuttle plasmld, making it desirable to use 

10 a transfection protocol with high efficiency. One such 
method involves use of a poly-L-lysinylated helper * 
adenovirus as described above. A pAdA shuttle plasmid 
containing the desired transgene under the control of a 
suitatble promoter, as described above, is then complexed 

15 directly to the positively charged helper virus capsid, 
resulting in the formation of a single transfection 
particle containing the pAdA shuttle vector and the 
helper functions of the helper virus. 

The underlying principle is that the helper 

20 adenovirus coated with plasmid pAdA DNA will co-transport 
the attached nucleic acid across the cell membrane and 
into the cytoplasm according to its normal mechanism of 
cell entry. Therefore, the poly-L- lysine modified helper 
adenovirus assumes multiple roles in the context of an 

25 AdA -based complex. Fireft, it is the structural 

foundation upon which plasmid DHA can bind increasing the 
effective concentration. Second, receptor mediated 
endocytosis of the virus provides the vehicle for cell 
uptake of the plasmid DNA. Third, the endosomalytic 

30 activity associated with adenoviral infection facilitates 
the release of internalized plasmid into the cytoplasm. 
And the adenovirus contributes trans helper functions on 
which the recombinant AdA virus is depend nt for 
replication and packaging of transducing viral particles. 

35 The Ad-based transf ction procedure using an pAdA shuttle 
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vector and a pol/cation-helper conjugate is detailed in 
Example 6, Additionally, as described previously, the 
helper virus-plasmid conjugate nay be another form of 
helper virus delivery of the omitted adenovirus genes not 
5 present in the pAdA vector. Such a structure enables the 
rest of the required adenovirus genes to be divided 
between the plasmid and the helper virus, thus reducing 
the self -replication efficiency of the helper virus. 

A presently preferred method of producing the 
10 recombinant AdA virus of this invention involves 

performing the above-described transfection with the 
crippled helper virus or crippled helper virus conjugate, 
as described above. A "crippled" helper virus of this 
invention is unable to package itself efficiently, and 
15 therefor permits ready separation of the helper virus 
from the newly packaged AdA vector of this invention by 
use of buoyant density ultracentrifugation in a Csci 
gradient, as described in the examples below. 

20 ZV. ruBOtioB of the RecombiBaBt AdA Virus 

Once the AdA virus of this invention is produced by 
cooperation of the shuttle vector and helper virus, the 
AdA virus can be targeted to, and taken up by, a selected 
target cell. The selection of the target cell also 

25 depends upon the use of the recombinant virus, i.e., 

whether or not the transgene is to be replicated in vitro 
or ex vivo for production in a desired cell type for 
redelivery into a patient, or in vivo for delivery to a 
particular cell type or tissue. Target cells may be any 

30 mammalian cell (preferably a human cell) . For example, 
in in vivo use, the recombinant virus can target to any 
cell type normally infected by adenovirus, depending upon 
the rout of administration, i. ., it can target, without 
limitation, neurons, hepatocyt s, pith lial c lis and 
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the like* The helper adenovixiis sequences supply the 
sequences necessary to permit uptake of the virus by the 
AdA. 

Once the recombinant virus is taken up by a cell, 
5 the adenovirus flanked transgt^ne is rescued from the 
parental adenovirus backbone by the machinery of the 
infected cell, as with other recombinant adenoviruses. 
Once uncoupled (rescued) from the genome of the AdA 
virus, the recombinant minigene seeks an integration site 
10 in the host chromatin and becomes integrated therein, 

either transiently or stably, providing expression of the 
accompanying transgene in the host cell. 

Use of the AdA Viruses in Gene Therapy 

15 The novel recombinant viruses and viral conjugates 

of this invention provide efficient gene transfer 
vehicles for somatic gene therapy. These viruses are 
prepared to contain a therapeutic gene in place of the 
LacZ reporter transgene illustrated in the exemplary 

20 viruses and vectors. By use of the AdA viruses 

containing therapeutic transgenes, these transgenes can 
be delivered to a patient in vivo or ex vivo to provide 
for integration of the desired gene into a target cell. 
Thus, these viruses can be employed to correct genetic 

25 deficiencies or defects. An example of the generation of 
an AdA gene transfer vehicle for the treatment of cystic 
fibrosis is described in Example 4 below. One of skill 
in the art can generate any number of other gene transfer 
vehicles by including a selected transgene for the 

30 treatment of other disorders. 

The recombinant viruses of the present invention may 
be administered to a patient, preferably suspended in a 
biologically compatible solution or pharmaceutically 
acceptable delivery vehicle. A suitable vehicle includes 

35 sterile saline. Other aqueous and non-aqueous isotonic 
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Sterile injection solutions and aqueous and non-aqueo»s 
sterile suspensions known to be pharmaceutically 
acceptable carriers and well known to those of skill in 
the art may be employed for this purpose. 
5 The recombinant viruses ot this invention may be 

administered in sufficient amounts to transfect the 
desired cells and provide sufficient levels of 
integration and expression of the selected transgene to 
provide a therapeutic benefit without undue adverse 
10 effects or with medically acceptable physiological 

effects which can be determined by those skilled in the 
medical arts. Conventional and pharmaceutically 
acceptable parenteral routes of administration include 
direct delivery to the target organ, tissue or site, 
15 intranasal, intravenous, intramuscular, subcutaneous, 
intradermal and oral administration. Routes of 
administration may be combined, if desired. 

Dosages of the recombinant virus will depend 
primarily on factors such as the condition being treated, 
20 the selected gene, the age, weight and health of the 
patient, and may thus vary among patients. A 
therapeutically effective human dosage of the viruses of 
the present invention is believed to be in the range of 
from about 20 to about 50 ml of saline solution 
25 containing concentrations of from about l x 10*' to 1 x 

10^° pfu/ml virus of the present invention. A preferred 
human dosage is about 20 ml saline solution at the above 
concentrations. The dosage will be adjusted to balance 
the therapeutic benefit against any side effects. The 
levels of expression of the selected gene can be 
monitored to determine the selection, adjustment or 
frequency of dosage administration. 
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The following examples illustrate the construction 
of the pAdA shuttle vectors, helper viruses and 
recombinant AdA viruses of the present invention and the 
use thereof in gene therapy. These examples are 
5 illustrative only, and do not limit the scope of the 
present invention. 

Example 1 - Production of pAdA>CMVLac2 and pAdAc.CMVLacZ 

10 A. pAaAfCMVLacZ 

A human adenovirus Ad5 sequence was modified to 
contain a deletion in the Ela region [map units 1 to 
9.2], which immediately follows the Ad 5» region (bp 1- 
360) (illustrated in Figs. lA) . Thus, the plasmid 

15 contains the 5* ITR sequence (bp 1-103), the native 

packaging/ enhancer sequences and the TATA box for the Ela 
region (bp l04-*360) • A minigene containing the OCV 
immediate early enhancer/ promoter, an SD/SA sequence, a 
cytoplasmic lacZ gene, and SV40 poly A (pA) , was 

20 introduced at the site of the Ela deletion. This 

construct was fiirther modified so that the minigene is 
followed by the 3* ITR sequences (bp 35,353-end). The 
DNA sequences for these components are provided in Fig. 3 
and SEQ ID NO: 1 (see, also the brief description of this 

25 figure) • 

This construct was then cloned by conventional 
techniques into a pSP72 vector (Promega) backbone to make 
the circular shuttle vector pAdACKVLacZ. See the 
schematic of Fig. 2A. This construct was engineered with 

30 EcoRI sites flanking the 5' and 3* Ad5 ITR sequences. 
pAdA.CMVLacZ was then subjected to enzymatic digestion 
with EcoRI, releasing a linear fragment of the vector 
spanning the terminal end f th Ad 5* ITR sequ nc 
through the terminal nd of the 3 'ITR sequence from th 

35 plasmid backbone. See Fig. 2B. 
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The shuttle vector pAdAc.CMVLacZ (Figs. 4A and 
5) was constructed using a pSP72 (Proaega) backbone so 
that the Ad5 5» ITR and 3' ITR were positioned head-to- 
5 tail. The organization of the Ad5 ITRs was based on 
reports that suggest circular Ad genomes that have the 
terminal ends fused together head-to-tail are infectious 
to levels comparable to linear Ad genomes. A minigene 
encoding the CMV enhancer, an SD/SA sequence, the LacZ 

10 gene, and the poly A sequence was inserted immediately 

following the 5' ITR. The DNA sequence of the resulting 
plasmid and the sequences for the individual components 
are reported in Fig. 5 and SEQ ID NO: 2 (see also, brief 
description of Fig. 5) . This plasmid does not require 

15 enzymatic digestion prior to its use to produce the viral 
particle (see Example 3). This vector was designed to 
enable restriction-independent production of LacZ AdA 
vectors. 

20 Rvamnle 2 - eonatruetion of a Helper VirUS 

The Ad.CBhpAP helper virus [K, Kozarsky et al, Sfiffi*. 
CAll Mol. Genet. . 19f5);449-458 (1993)) is a replication 
deficient adenovirus containing an alkaline phosphatase 
minigene. Its construction involved conventional cloning 

25 and homologous recombination techniques. The adenovirus 
DNA substrate was extracted from CsCl purified dl7001 
virions, an Ad5 (serotype subgroup C) variant that 
carries a 3 kb deletion between mu 78.4 through 86 in the 
nonessential E3 region (provided by Dr. William Wold, 

30 Washington University, St. Louis, Missouri) . Viral DNA 
was prepared for co-transfection by digestion with Clal 
(adenovirus genomic bp position 917) which removes the 
left arm of the genome ncompassing adenovirus map units 
0-2.5. See lower diagram of Fig. IB. 
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A parental cloning vector, pAd.Bglll was designed. 
It contains two segments of wild-type Ad5 genome (I.e., 
map units 0*1 and 9-16.1) separated by a unique Bglll 
cloning site for insertion of heterologous sequences. 
5 The missing Ad5 sequences betvaen the two domains 

(adenovirus genome bp 361-3327) results in the deletion 
of Ela and the majority of Elb following recombination 
with viral DNA. 

A recombinant hpAP minigene was designed and 
10 inserted into the Bglll site of pAd.Bglll to generate the 
'complementing plasmid, pAdCBhpAP. The linear arrangement 
of this minigene includes: 

(a) the chicken cytoplasmic fi-actin promoter 
[nucleotides +1 to +275 as described in T. A. Host et al, 

15 y^<?lt AgK^s R^g,, 11(23) :8287 (1983); nucleotides 9241- 
8684 of Fig. 7]; 

(b) an SV40 intron (e.g., nucleotides 1579-1711 of 
SEQ ID NO: 2), 

(c) the sequence for human placental alkaline 
20 phosphatase (available from Genbank) and 

(d) an SV40 polyadenylation signal (a 237 Bam HI- 
Bell restriction fragment containing the cleavage/poly-A 
signals from both the early and late transcription units; 
e.g., nucleotides 837-639 of SEQ ID NO: 1). 

25 The resulting complementing plasmid, pAdCBhpAP 

contained a single copy of recombinant hpAP minigene 
flanked by adenovirus coordinates 0-1 on one side and 
9.2-16.1 on the other. 

Plasmid DNA was linearized using a unique Nhel site 

30 immediately 5" to adenovirus map unit zero (0) and the 
above-identified adenovirus siibstrate and the 
complementing plasmid DNAs were transfected to 293 cells 
[ATCC CRL1573] using a standard calcixim phosphate 
transfection procedure [s , e.g., Sambrook et al, cit d 

35 above]. The end result f homologous recombination 



wo 96/13597 



PCT/US9S/14017 



34 

involving sequences that map to adenovirus map units 9- 
16.1 is hybrid Ad.CBhpAP helper virus vhith contains 
adenovirus map units 0-1 and, in place of the Bla and Elb 
coding regions from the dl7001 adenovirus substrate, is 
5 the hpAP minigene from the plasaid, followed by Ad 

sequences 9 to 100, with a deletion in the E3 (78.4-86 
nu) regions. 

Tivawpla a - pyrvautstion of Recombinant AdA Virug 
10 The recombinant AdA virus of this invention are 

generated by co-transfection of a shuttle vector with the 
helper virus in a selected packaging or non-packaging 
cell line. 

As described in detail below, the linear fragment 

15 provided in Example lA, or the circular AdA genome 

carrying the LacZ of Example IB, is packaged into the 
Ad.CBhpAP helper virus (Exaaple 2) using conventional 
techniques, which provides an ea^ty capsid head, as 
illustrated in Pig. 2C. Those virus particles which have 

20 successfully taken up the pAd shuttle genome into the 
capsid head can be distinguished from those containing 
the hpAP gene by virtue of the differential expression of 
LacZ and hpAP. 

In more detail, 293 cells (4 x lo'' pfu 293 cells/ISO 

25 mm dish) were seeded and infected with helper virus 

Ad.CBhpAP (produced as described in Exan^le 2) at an MOI 
of 5 in 20 ml DHEM/2% fetal bovine serum (FBS) . This 
helper specific marker is critical for monitoring the 
level of helper virus contamination in AdA preparations 

30 before and after purification. The helper virus provides 
in trans the necessary helper functions for synthesis and 
packaging of the AdACMVLacZ genome. 

Two hours post infection, using either the 
r striction-dependent shuttle vector or th r striction- 

35 independent shuttle vector, plasmid pAdA.CMVLacZ 
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(digested with ScoRI) or pAdAc«CMVLacZ DNA, each carrying 
a LacZ minigene, was added to the cells by a calcium 
phosphate precipitate (2.5 ml calcium phosphate 
transf ection cocktail containing 50 fig plasmid ONA) . 
5 Thirty to forty hours pobt-transf ection, cells wc»:e 

harvested, suspended in 10 vM Tris-Cl (pH 8.0) (0.5 
ml/ISO mm plate) and frozen at -80<*C. Frozen cell 
suspensions were subjected to three rounds of freeze 
(ethanol-dry ice) -thaw (37»C) cycles to release virion 

10 capsids. Cell debris was removed by centrifugation 

(5,000xg for lO^^minutes) and the clarified supernatant 
applied to a CsCl gradients to separate recombinant virus 
from helper virus as follows. 

Supematants (10 ml) applied to the discontinuous 

15 CsCl gradient (composed of ecpial volumes of CsCl at 1.2 
g/ml, 1.36 g/ml, and 1.45 g/ml 10 xbM Tris-Cl (pH 8.0)) 
were centrifuged for 8 hours at 72,l28Xg, resulting in 
separation of infectious helper virus from incompletely 
formed virions. Fractions were collected from the 

20 interfacing zone between the helper and top components 
and analyzed by Southern blot hybridization or for the 
presence of LacZ transducing particles. For functional 
analysis, aliquots (2.0 ml from each sample) from the 
Bame fractions were added to monolayers of 293 cells (in 

25 35 mm wells) and expression of recombinant 

galactosidase determined 24 hours later. More 
specifically, monolayers were harvested, suspended in 0.3 
ml 10 mM Tris-Cl (pH 8.0) buffer and an extract prepared 
by three rounds of freeze-thaw cycles. Cell debris was 

30 removed by centrifugation and the supernatant tested for 
B-galactosidase (LacZ) activity according to the 
procedure described in J. Price et al, Proc. Natl. Acad. 
Sci., USA . M: 156-160 (1987). The specific activity 
(milliunits B-galactosidase/mg protein or reporter 
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enzymes was measured from indicator cells. For the 
recombinant virus, specific activity was 116. 

Fractions with B-galactosidase activity from the 
discontinuous gradient were sedimented through an 
5 equilibrium cesium gradient to further enrich the 
preparation for AdA virus. A linear gradient was 
generated in the area of the reccnibinant virus spanning 
densities 1.29 to 1.34gm/ml. A sharp peak of the 
recombinant virus, detected as the appearance of the B- 
10 gal activity in infected 293 cells, eluted between 1.31 
and 1.33 gm/dl. This peak of recombinant virus was 
located between two major Aago ™" absorbing peaks and in 
an area of the gradient with the helper virus was 
precipitously dropping off. The equilibrium 
15 sedimentation gradient acconqplished another 102 to 103 

fold purification of recombinant virus from helper virus. 
The yield of recombinant AdA.OiVLacZ virus recovered from 
a 50 plate prep after 2 sedimentations ranged from 107 to 
108 transducing particles. 
20 Analysis of lysates of cells transfected with the 

recombinant vector and infected with helper revealed 
virions capable of transducing the recombinant minigene 
contained within the vector. Subjecting aliquots of the 
fractions to Southern analysis using probes specific to 
25 the recombinant virus or helper virus revealed packaging 
of multiple molecular forms of vector derived sequence. 
The predominant form of the deleted viral genome was the 
size ("5.5 kb) of the corresponding double stranded DNA 
monomer (AdA.CHVLacZ) with less abundant but discrete 
30 higher molecular weight species ("10 kb and "15 kb) also 
present. Full-length helper virus is 35kb. Importantly, 
the peak of vector transduction activity corresponds with 
the highest mol cular weight form of the delet d virus. 
Thes results confirm the hypoth sis that ITRs and 
35 contiguous packaging sequence ar the nly elements 
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necessary for incorporation into virions. An apparently 
ordered or preferred rearrangement of the recombinant Ad 
nononer genome leads to a more biologically active 
molecule. The fact that larger molecular species of the 
5 deleted genome are 2x and 3x lold larger than the monomer 
deleted virus genome suggests that the rearrangements may 
involve sequential duplication of the original genome. 

These same procedures may be adapted for production 
of a recombinant AdA virus using a crippled helper virus 
10 or helper virus conjugate as described previously. 

E)t^Pl^ 4 - R^gPBfeinant AdA Virwff CQntaininq a 

Therapeutic Miniaene 

To test the versatility of the recombinant AdA virus 

15 system, the reporter LacZ minigene obtained from 

pAdACMVLacZ was cassette replaced with a therapeutic 
minigene encoding CFTR. 

The minigene contained himan CFTR cDNA [Riordan et 
al. Science, jglg: 1066-1073 (1989); nucleotides 8622-4065 

20 of SEQ ID NO: 3] under the transcriptional control of a 
chimeric CMV enhancer/chicken B-actin promoter element 
(nucleotides +1 to +275 as described in T. A. Kost et al, 
Nucl. Acids Res,. 11(23) :8287 (1983); nucleotides 9241- 
8684 of SEQ ID NO: 3, Fig. 7); and followed by an SV-40 

25 poly-A sequence (nucleotides 3887-3684 of SEQ ID NO: 3, 
Pig. 7). 

The CFTR minigene was inserted into the El deletion 
site of an Ad5 virus (called pAd.ElA) which contains a 
deletion in Ela from mu 1-9.2 and a deletion in E3 from 
30 mu 78.4-86. 

The resulting shuttle vector called pAdA.CBCFTR (see 
Figs. 6 and the DNA sequence of Fig. 7 [SEQ ID NO: 3]) 
used the sam Ad ITRs of pAdACNVLacZ, but the Ad5 
sequences t rminat d with Nhel sites instead of EcoRI. 
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Therefore releasa of the minlgene from the plasmld was 
accomplished by digestion with Nhel. 

The vector production system described in Example 3 
was employed, using the helper virus Ad.CBhpAP (Example 
5 2) . Monolayers of 293 cells giown to 80-90% confluency 
in 150 mm culture dishes were infected with the helper 
virus at an MOI of 5. Infections were done in DMEM 
supplemented with 2% FBS at 20 ml media/ 150 mm plate. 
Two hours post-infection, 50 ng plasmid DNA in 2.5 ml 
10 transfection cocktail was added to each plate and evenly 
distributed. 

Delivery of the pAdA.CBCFTR plasmid to 293 cells was 
mediated by formation of a calcium phosphate precipitate 
and AdA.CBCFTR virus resolved from Ad.CBhpAP helper virus 

15 by CsCl buoyant density ultracentrifugation as follows: 
Cells were left in this condition for 10-14 h, 
afterwhich the infection/transfection media was replaced 
with 20 ml fresh DKEM/2% FBS. Approximately 30 h post- 
transfection, cells were hsurvested, suspended in 10 mM 

20 Tris-Cl (pH 8.0) buffer (0.5 ml/150 mm plate), and stored 
at -80«C. 

Frozen cell suspensions were lysed by three 
sequential rounds of freeze (ethanol-dry ice) -thaw 
(37»C). Cell debris was removed by centrifugation (5,000 

25 X g for 10 min) and 10 ml clarified extract layered onto 
a CsCl step gradient composed of three 9.0 ml tiers with 
densities 1.45 g/ml, 1.36 g/ml, and 1.20 g/ml CsCl in 10 
mM Tris-Cl (pH 8.0) buffer. Centrifugation was performed 
at 20,000 rpm in a Beckman SW-28 rotor for 8 h at 4«C. 

30 Fractions (1.0 ml) were collected from the bottom of the 
centrifuge tube and analyzed for rAAd transducing 
vectors. Psak fractions were combined and banded to 
equilibrium. Fractions c ntaining transducing virions 
wer dialyzed against 20 mM HEPES (pH 7.8) /150 mM NaCl 
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(HBS) and stored frozen at -80 in the presence of 10% 
glycerol or as a liquid stock at *20^C CHBS+40% 
glycerol) • 

Fractions collected after ultracentrif ugation were 
5 analyzed for transgene expresbion and vector DHA« For 
lacZ ArAd vectors, 2 ^1 aliquots were added to 293 cell 
monolayers seeded in 35 mm culture wells. Twenty-four 
hotirs later cells were harvested, suspended in 0.3 ml 10 
mM Tris-Cl (pH 8.0) buffer, and lysed by three rounds of 

10 freeze-thaw. Cell debris was removed by centrifugation 
(15,000 X g for 10 min) and assayed for total protein 
[Bradford, (1976)] and ^-galactosidase activity [Sambrook 
et al, (1989)] using ONPG (o-Nitrophenyl p-D- 
galactopyranoside) as substrate. 

15 Expression of CFTR protein from the AdA.CBCFTR 

vector was determined by immunofluorescence localization. 
Aliquots of AdA.CBCFTR, enriched by two-rounds of 
ultracentrif ugation and exchanged to HBS storage buffer, 
were added to primary cultures of airway epithelial cells 

20 obtained from the lungs of CF transplant recipients. 
Twenty-four hours after the addition of vector, cells 
were harvested and affixed to glass slides using 
centrifugal force (Cytospin 3, Shandon Scientific 
Limited) • Cells were fixed with freshly prepared 3% 

25 paraformaldehyde in PBS (1.4 slM KH2PO4, 4.3 mM Na2HP04, 
2.7 oM KCl, and 137 mM NaCl) for 15 min at room 
temperature (RT) , washed twice in PBS, and permeabilized 
with 0.05% NP-40 for 10 min at RT. The 
immunofluorescence procedure began with a blocking step 

30 in 10% goat serum (PBS/6S) for 1 h at RT, followed by 
binding of the primary monoclonal mouse anti-human CFTR 
(R-domain specific) antibody (Genzyme) diluted 1:500 in 
PBS/GS for 2 h at RT. Cells wer wash d xt nsively in 
PBS/GS and incubat d for 1 h at RT with a donkey anti- 

35 mouse IgG (H+L) FITC conjugated 
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antibody (Jackson IininunoResearch Laboratories) diluted 
1:100 in PBS/GS. 

For Southern analysis of vector DHA, 5 Ml aliquots 
were taken directly from CsCl fractions and incubated 
5 with 20 Ml capsid digestion butfer (50 sM Tris-Cl, pH 
8.0; 1.0 jM EDTA, pH 8.0; 0.5% SDS, and 1.0 mg/ml 
Proteinase K) at 50«C for 1 h. The reactions were 
allowed to cool to RT, loading dye was added, and 
. electrophoresed through a 1.2% agarose gel. Resolved 
10 DNAs were electroblotted onto a nylon membrane (Hybond-N) 
and hybridized with a 32-P labeled restriction fragment. 
Blots were analyzed by autoradiography or scanned on a 
Phosphorimager 445 SI (Molecular Dynamics). 

The results that were obtained from Southern blot 
15 analysis of gradient fractions revealed a distinct viral 
band that migrated faster than the helper Ad.CBhpAP DNA. 
The highest viral titers mapped to fractions 3 and 4. 
Quantitation of the bands in fraction 4 indicated the 
titer of Ad.CBhpAP was approximately 1.5x greater than 
20 AdACBCFTR. However, if the size difference between the 
two viruses is factored in (Ad.CBhpAP=35 kb; 
AdACBCFTR=6.2 kb) , the viral titer (where 1 particle=l 
DNA molecule) of AdACB.CFTR is at least 4-fold greater 
than the viral titer of Ad.CBhpAP. 
25 While Southern blot analysis of gradient fractions 

was useful for showing the production of AdA viral 
particles, it also demonstrated the utility of 
ultracentrifugation for purifying AdA viruses. 
Considering the latter of these, both LacZ and CFTR 
30 transducing viruses banded in CsCl to an intermediate 
density between infectious adenovirus helper virions 
(1.34 g/ml) and incompletely formed capsids (1.31 g/ml) . 
Th lighter density relative to h Ip r virus likely 
results from the smaller genom carri d by the AdA 
35 virus s. This further sugg sts changes in virus size 
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influences the density and purification of AdA virus. 
Regardless, the ability to separate AdA virus from the 
helper virus is an important observation and suggests 
further purification may be achieved by successive roiinds 
5 of banding through CsCl. 

This recombinant virus is useful in gene therapy 
alone, or preferably, in the form of a conjugate prepared 
as described herein. 

10 Example 5 - correction of Gene tic Defect in CP alrvav 
Epithelial cells wil^h AdACB.CFTO 

Treatment of cystic fibrosis, utilizing the 
recombinant virus provided above, is particularly suited 
for in vivo, lung«*directed, gene therapy. Airway 
15 epithelial cells are the most desirable targets for gene 
transfer because the pulmonary complications of CP are 
usually its most morbid and life-limiting. 

The recombinant AdACB.CFTR virus was fractionated on 
sequential CsCl gradients and fractions containing CFTR 
20 sequences, migrating between the adenovirus and top 

components fractions described above were used to infect 
primary cultures of human airway epithelial cells derived 
from the lungs of a CP patient. The cultures were 
subsequently analyzed for expression of CFTR protein by 
25 immunocytochemistry. Immunofluorescent detection with 
mouse anti-human CFTR (R domain specific) antibody was 
performed 24 hours after the addition of the recombinant 
virus. Analysis of mock infected CF cells failed to 
reveal significant binding to the R domain specific CFTR 
30 antibody. Primary airway epithelium cultures exposed to 
the recombinant virus demonstrated high levels of CFTR 
protein in 10-20% of the cells. 

Thus, the recombinant virus of th invention, 
containing the CFTR gene, may be delivered directly into 
35 the airway, e.g. by a formulating the virus abov , into a 
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preparation which can be inhaled. For example, the 
recombinemt virus or conjugate of the invention 
containing the CFTR gene, is suspended in 0.25 molar 
sodium chloride. The virus or conjugate is taken up by 
5 respiratory airway cells and ti.e gene is expressed. 

Alternatively, the virus or conjugates of the 
invention may be delivered by other suitable means, 
including site-directed injection of the virus bearing 
the CFTR gene. In the case of CFTR gene delivery, 

10 preferred solutions for bronchial instillation arc 

sterile saline solutions containing in the range of from 
about 1 X lo' to 1 X 10^° pfu/ml, more particularly, in 
the range of from about 1 x 10^ to 1 x lo' pfu/ml of the 
virus of the present invention. 

15 Other suitable methods for the treatment of cystic 

fibrosis by use of gene therapy recombinant viruses of 
this invention may be obtained from the art discussions 
of other types of gene therapy vectors for OF. See, for 
example, U. S. Patent Ho. 5,240,846, incorporated by 

20 reference herein. 



T^vA^ piA 6 - svnt hAais at Polvcation Helper Virus 
conjugate 

Another version of the helper virus of this 
25 invention is a poly lysine conjugate which enables the 

pAdA shuttle plasmid to complex directly with the helper 
virus capsid. This conjugate permits efficient delivery 
of shuttle plasmid pAdA shuttle vector in tandem with the 
helper virus, thereby removing the need for a separate 
30 transfection step. See, Fig. 10 for a diagrammatic - 
outline of this construction. Alternatively, such a 
conjugate with a plasmid supplying some Ad genes and the 
h Iper supplying the remaining necessary g n s for 
production of the AdA viral vector provides a novel way 
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to reduce contsusination of the helper virus, as discussed 
above. 

Purified stocks of a large-scale expansion of 
Ad.CBhpAP were modified by coupling poly-L- lysine to the 
5 virion capsid essentially as dt^scribed by K. J. Fisher 
and J. M. Wilson, Blochem, J. . 299; 49-58 (1994), 
resulting in an Ad.CBhpAP-(Lys),^ conjugate. The 
procedure involves three steps. 

First, CsCl band ptirified helper virus Ad.CBhpAP was 

10 reacted with the heterobifunctional crosslinker sulfo- 
SMCC [ sulf o- (N-succinimidyl-4- (N-maleimidomethyl) 
cyclohexane-l-car boxy late] (Pierce) . The conjugation 
reaction, which contained 0.5 mg (375 nmol) of sulpho- 
SMCC and 6 X 10^^ A2eo >^elp^i^ virus particles in 3.0 ml of 

15 HBS, was incubated at 30^C for 45 minutes with constant 
gentle shaking. This step involved formation of a 
peptide bond between the active N-hydroxysuccinimide 
(KHS) ester of sulpho-SMCC and a free amine (e.g. lysine) 
contributed by an adenovirus protein sequence (capsid 

20 protein) in the vector, yielding a maleimide-activated 
viral particle. The activated adenovirus is shown in 
Fig. 10 having the capsid protein fiber labeled with the 
nucleophilic maleimide moiety. In practice, other capsid 
polypeptides including hexon and penton base are also 

25 targeted. 

Unincorporated, unreacted cross-linker was removed 
by gel filtration on a 1 cm x 15 cm Bio-Gel P-6DG (Bio- 
Rad Laboratories) column equilibrated with 50 mN Tris/HCl 
buffer, pH 7.0, and 150 mM NaCl. Peak Ajgo fractions 

30 containing maleimide-activated helper virus were combined 
and placed on ice. 

Second, poly-L-lysine having a molecular mass of 58 
kDa at 10 mg/ml in 50 mM tri thanolamine buffer (pH 8.0), 
150 mM NaCl and 1 mM EDTA was thi lated with 2- 

35 imminothiolane/KCl (Traut's Reagent; Pierce) to a molar 
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ratio of 2 moles -SH/mole poly lysine under Nj; the cyclic 
thioimidate reacts with the poly (L-lysine) primary amines 
resulting in a thiolated polycation. After a 45 minute 
incubation at room temperature the reaction was applied 
5 to a 1 cm X 15 cm Bio-Gel P6DG column equilibrated with 
50 aM Tris/HCl buffer (pH 7.0), 150 mM MaCl and 2 mM EDTA 
to remove unincorporated Traut's Reagent. 

Quantification of free thiol groups was accomplished 
with Ellman's reagent [5,5»-dlthio-bis-(2-nitroben2oic 

10 acid)], revealing approximately 3-4 mol of -SH/mol of 

poly (L-lysine) . The coupling reaction was initiated by 
adding 1 x 10^^ A260 particles of maleimide-activated 
helper virus/mg of thiolated poly (L-lysine) and 
incubating the mixture on ice at 4*C for 15 hours under 

15 argon. 2-mercaptoethylamine was added at the couplet ion 
of the reaction and incubation carried out at room 
temperature for 20 minutes to block unreacted maleimide 
sites. 

Virus-polylysine conjugates, Ad.CPAP-p(Lys)„, were 
20 purified away from unconjugated poly (L-lysine) by 

ultracentrlfugation through a CsCl step gradient with an 
initial composition of equal volumes of 1.45 g/ml (bottom 
step) and 1.2 g/ml (top step) CsCl in 10 mM Tris/HCl 
buffer (pH 8.0), Centrifugation was at 90,000 g for 2 
25 hours at 5*C. The final product was dialyzed against 20 
AM Hepes buffer (pH 7.8) containing 150 mM NaCl (HBS) . 

Rvantnle 7 - Formation of AdA/helper-PTiVS Viral Pagtlffl? 
The formation of Ad.CBhpAP-pLys/pAdA.CMVLacZ 

30 particle is initiated by adding 20 tig plasmid 

pAdA.CMVLacZ DMAs to 1.2 x lo" Ajeo particles Ad.CBhpAP- 
pLys in a final volume of 0.2 ml DMEM and allowing the 
complex to develop at room temperature for betwe n 10-15 
minutes. This ratio typically represents the plasmid DNA 

35 binding capacity of a standard lot of ad novirus-pLys 
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conjugate and gives the highest levels of plasmid 
transgene expression. 

The resulting trans-infection particle is 
transfected onto 293 cells (4 x 10*' cells seeded on a 150 
5 BDn dish) • Thirty hoiurs after transfection, the particles 
are recovered and subjected to a freeze/thav technique to 
obtain an extract. The extract is purified on a CsCl 
step gradient with gradients at 1.20 g/ml, 1.36 g/ml and 
1.45 g/ml. After centrifugation at 90,000 x g for 8 
10 hours, the AdA vectors were obtained from a fraction 

under the top components as identified by the presence of 
LacZ, and the helper virus was obtained from a smaller, 
denser fraction, as identified by the presence of hpAP. 

15 Example 8 - Construction of Modified Helper Viruses with 
Crippled Packaging fPAC) Sequences 

This example refers to Figs. 9A through 9C, lOA and 
lOB in the design of modified helper viruses of this 
invention. 

20 Ad5 5* terminal sequences that contained PAC domains 

I and ZI (Fig. 8A) or PAC domains I, II, III, and IV 
(Fig. 8B) were generated by PCR from the wild type Ad5 5* 
genome depicted in Fig. IB using PGR clones indicated by 
the arrows in Fig. IB. The resulting amplification 

25 products (Fig. 8A and 8B) sequences differed from the 
wild-*type Ad5 genome in the number of A~repeats carried 
by the left (5*) end. 

As depicted in Fig. 8C, ^ese amplification products 
were stibcloned into the multiple cloning site of 

30 pAd.Link.l (IR6T Vector Core). pAd.Link.l is a 

adenovirus based plasmid containing adenovirus m.u. 9.6 
through 16.1. The insertion of the modified PAC regions 
into pAd.Link.l gen rat d two v ct rs pAd.PACII 
(containing PAC domains I and II) and pAd.PACIV 

35 (containing PAC domains I, II, III, and IV). 
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Thereafter, as depicted in Figs. lOA and lOB, for 
each of these plasmlds, a huxaan placenta alkaline 
phosphatase reporter mlnlgene containing the Immediate 
early OfV enhancer/promoter (CMV) , human placenta 
5 alkaline phosphatase cDNA (hpAV), and SV40 

polyadenylatlon signal (pA) , was subcloned into each PAC 
vector, generating pAd.PACII.CMVhpAP and 
pAd.PACIV^CMVhpAP, respectively. 

These plasmlds were then used as substrates for 
10 homologous recombination with dl7001 virus, described 
above, by co-transf ectlon Into 293 cells. Homologous 
recombination occurred between the adenovirus map units 
9-16 of the plasmld and the crippled Ad5 virus. The 
results of homologous recombination were helper viruses 
15 containing Ad5 .5' terminal sequences that contained PAC 
domains I and II or PAC domains I, II, III, and IV, 
followed by the mlnlgene, and Ad5 3» sequences 9.6-78.3 
and 87-100. Thus, these crippled viruses are deleted of 
the El gene and the £3 gene. 
20 The plaque formation characteristics of the PAC 

helper viruses gave an Immediate Indication that the PAC 
modifications diminished the rate and extent of growth. 
Specifically, PAC helper virus plaques did not develop 
\mtll day 14-21 post-transf ectlon, and on maturation 
25 remained small. From previous experience, a standard 
first generation Ad.CBhpAP helper virus with a complete 
left terminal sequence would begin to develop by day 7 
and mature by day 10. 

Viral plaques were picked and suspended In 0.5 ml of 
30 DMEM media. A small aliquot of the virus stock was used 
to Infect a fresh monolayer of 293 cells and 
hlstochemlcally stained for recombinant alkaline 
phosphates activity 24 h urs post-inf ctlon. Six of 
eight Ad.PACIV.CMVhpAP ( ncodes A-repeats I-IV) clon s 
35 that were screened for transgene expr sslon were 
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positive, while all three Ad • PACII • CMVhpAP clones that 
were selected scored positive. The clones have been 
taken through two rounds of plaque purification and are 
currently being expanded to generate a working stock. 
5 These crippled helper viruses are useful in the 

production of the AdA virus particles according to the 
procedures described in Example 3* They are 
characterized by containing sufficient adenovirus genes 
to permit the packaging of the shuttle vector genome, but 

10 their crippled PAC sequences reduce their efficiency for 
self-encapsidation. Thus less helper viruses are 
produced in favor of more AdA recombinant viruses. 
Purification of AdA virus particles from helper viruses 
is facilitated in the CsCl gradient, which is based on 

15 the weight of the respective viral particles. This 

facility in purification is a decided advantage of the 
AdA vectors of this invention in contrast to adenovirus 
vectors having only El or smaller deletions • The AdA 
vectors even with minigenes of up to about 15 kb are 

20 significantly different in weight than wild type or other 
adenovirus helpers containing many adenovirus genes. 

Example 9 - AdA Vector Containing a full-lenoth 
dystrophin transaene 

25 Duchenne muscular dystrophy (DHD) is a common x- 

linked genetic disease caused by the absence of 
dystrophin, a 427K protein encoded by a 14 kilobase 
transcript. Lack of this important sarcolemmal protein 
leads to progressive muscle wasting, weakness, and death. 

30 One current approach for treating this lethal disease is 
to transfer a functional copy of the dystrophin gene into 
the affected muscles. For skeletal muscle, a 
r plication-defective adenovirus repres nts an efficient 
d livery system. 



35 
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According to the present invention, a recombinant 
plasmid pAdA.CHVmdys was created which contains only the 
Ad5 cis-elements (i.e., ITRs and contiguous packaging 
sequences) and harbors the full-length murine dystrophin 
5 gene driven by the CMV promote^-. This plasmid was 
generated as follows. 

pSLllBO [Pharmacia Biotech] was cut with Wot I, 
filled in by Klenow, and religated thus ablating the Wot 
I site in the plasmid. The resulting plasmid is termed 
10 pSLllSONN and carries a bacterial ori and Amp resistance 
gene. 

pAdA.CMVliacZ of Example 1 was cut with EcoRI, 
klenowed, and ligated with the Apal-cut pSLllSONN to form 
pAdA.CMVLac^ (Apal) . 

15 The 14 kb mouse dystrophin cDMA [sequences 

provided in C. C. Lee et al, HataaiSr 3^2:334-336 (1991)] 
was cloned in two large fragments using a lambda ZAP 
cloning vector (Stratagene) and subsequently cloned into 
the bluescript vector pSK- giving rise to the plasmid 

20 pCCL-DMD. A schematic diagram of this vector is provided 
in Fig. 11, which illustrates the restriction enzyme 
sites. 

pAdA.CMVIracZ (Apal) was cut with WotI and the 
large fragment gel isolated away from the lacZ cDNA. 
25 pCCL-DMD was also cut with WotI, gel isolated and 

subseqently ligated to the large NotI fragment of NotI 
digested pAdA.CMVLacZ (Apal). The sequences of resulting 
vector, pAdA.CHVmdys, are provided in Fig. 12A-12P [SEQ 
ID NO: 10]. 

30 This plasmid contains secjuences form the left- 

end of the Ad5 encompassing bp 1-360 (5' ITR) , a mouse 
dystrophin minigene under the control of the CMV 
promoter, and sequence from the right end of Ad5 spanning 
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bp 35353 to the end of the genome (3* ITR) • The mlnlgene 
is followed by an SV-40 poly- A sequence similar to that 
described for the plasmids described above. 

The vector production system described herein is 
5 employed. Ten 150mm 293 plates are infected at about 90% 
conf luency with a reporter recombinant El-deleted viirus 
Ad.CBhpAP at an HOI of 5 for 60 minutes at 37 «C. These 
cells are transfected with pAdA.CHVIaiDys by calcium 
phosphate co-precipitation using 50 /a? linearized 

10 DNA/dish for about 12-16 hours at 37<»C. Media is 
replaced with DHEM + io% fetal bovine serum. 

Full cytopathic effect is observed and a cell lysate 
is made by subjecting the cell pellet to freeze-thav 
procedures three times. The cells are subjected to an 

15 SW41 three tier CsCl gradient for 2 hours and a band 
migrating between the helper adenovirus and incomplete 
virus is detected. 

Fractions are assayed on a 6 well plate containing 
293 cells infected with 5X of fraction for 16-20 hours in 

20 DMEM + 2% FBS. Cells are collected, washed with 

phosphate buffered saline, and resuspended in 2 ml PBS. 
200X of the 2ml cell fractions is cytospun onto a slide. 

The cells were subjected to immunofluorescence for 
dystrophin as follows. Cells were fixed in lOH MeOH at 

25 -20^C. The cells were exposed to a monoclonal antibody 
specific for the carboxy terminus of human dystrophin 
[NCL-D¥S2; Novocastra Laboratories Ltd., UK]. Cells were 
then washed three times and exposed to a secondary 
antibody, i.e. 1:200 goat anti-mouse IgG in FITC. 

30 The titer/ fraction for seven fractions revealed in 

the immunof luorescent stains were calculated by the 
following formula and reported in Table 2 below. 
DFU/field = (DFU/200X c lis) x 10 = DFU/10* cells « 
(DFU/5X viral fraction) x 20 = DFU/IOOX fraction. 



35 
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10 



15 



Table 2 

1 
2 

3 6 X 10' 

4 1.8 X 10* 

5 9.6 X 10' 



6 
7 



200 
200 



A virus capable o£ transducing the dystrophin 
ninigene is detected as a "positive" (i.e., green 

20 fluorescent) cell. The results of the IF illustrate that 
heat-treated fractions do not show positive 
immiinofluorescence. Southern blot data suggest one 
species on the same size as the input DNA, with helper 
virus conteuaination. 

25 The recombinant virus can be subsequently separated 

from the majority of helper virus by sedimentation 
through cesium gradients. Initial studies demonstrate 
that the functional AdCMVAmDys virions are produced, but 
are contaminated with helper virus. Successful 

30 purification would render AdA virions that are incapable 
of encoding viral proteins but are capable of transducing 
murine skeletal muscle. 

Example i n - Pseudotvoina 
35 The following experiment provides a method for 

preparing a recombinant AdA according to the invention, 
utilizing helper viruses from serotypes which differ from 
that of th pAdA in the transfection/ infection protocol. 
It is unexpected that the ITRs and packaging sequence of 
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Ad5 could be incorporated into a virion of another 
serotype. 

A. Protocol 

The basic approach is to transfect the 
5 AdA.CMVlacZ recombinant virus (Ad5) into 293 cells and 
subsequently infect the cell with the helper virus 
derived from a variety of Ad serotypes (2, 3, 4, 5, 7, 8, 
12, and 40). When CPE is achieved, the lysate is 
harvested and banded through two cesium gradients. 

10 More particulsurly, the Ad5-based plasmid 

pAdA.CNVIacZ of Example 1 was linearized with EcoRI. The 
linearized plasmids were then transfected into ten 150 mm 
dishes of 293 cells using calcium phosphate co- 
precipitation. At 10-15 hours post transfect ion, wild 

15 type adenoviruses (of one of the following serotypes: 2, 
3, 4, 5, 7, 12, 40) were used to infect cells at an MOI 
of 5. The cells were then harvested at full CPE and 
lysed by three rounds of freeze-thawing. Pellet is 
resuspended in 4 mL Tris-HCl. Cell debris was removed by 

20 centrifugation and partial purification of Ad5A.CMVIacZ 
from helper virus was achieved with 2 rounds of CsCl 
gradient centrifugation (SW41 column, 35,000 rpm, 2 
hours) • Fractions were collected from the bottom of the 
tube (fraction #1) and analysed for letcZ transducing 

25 viruses on 293 target cells by histochemical staining (at 
2 Oh PI) • Contaminating helper viruses were quant itated 
by plaque assay. 

Except for adenovirus type 3, infection with Ad 
serotypes 2, 4, 5, 7, 12 and 40 were able to produce lacZ 

30 transducing viruses. The peak of p-galactosidase 
activity was detected between the two major A2eo 
absorbing peaks, where most of the helper viruses banded 
(data not shown) • The quantity of iac^ virus recovered 
from 10 plates rang d from 10^ to 10® transducing 

35 particles depending on the serotype of the helper. As 
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expected Ad2 and Ad5 produced the highest titer of lacZ 
transducing viruses (Table 3). Wild type contamination 
was in general lO^-io^ log higher than corresponding lac^ 
titer except in the case of Ad40. 
5 B. Results 

Table 3 sumuurizes the growth characteristics 
of the wild type adenoviruses as evaluated on propagation 
in 293 cells. This demonstrated- the feasibility of 
utilizing these helper viruses to infect the cell line 
10 which has been transf acted with the Ad5 deleted virus. 

Table 3 





Adenovirus serotypes 




p/ml 


pfu/ml 


p:pfu 


15 


2 


5 


X 10^2 


2.5 X lO" 


20:01 




3 


1 


X lo" 


6.25 X 10^ 


160:1 


20 


4 


3 


X lo" 


2 X 10* 


150:1 




5 


■ 1 


X lo" 


5 X lO" 


20:01 




7a 


5 


X 10^2 


1 X lO" 


50:1 


25 


12 


6 


X lo" 


4 X 10* 


150:1 




35 


1.2 


X lo" 






30 


40 


2.2 


X lO" 


4.4 X 10® 


5000:1 



Table 4 summarizes the results of the final 
purified fractions. The middle column, labeled LFU/^il 

35 quantifies the production of ZacZ forming units, which is 
a direct measure of the packaging and propagation of 
pseudotyped recombinant AdA virus. The pfu/zil titer is 
an estimate of the contaminating wild type virus. AdA 
virus pseudotyped with all adenoviral strains was 

40 g n rated except for Ad3. The titers range between lo"' - 
10*. 
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Table 4 
Serotypes LFU/bI 
2 4.6 X 10^ 



PFO/ml 
1.8 X lO' 



10 



15 



20 



3 
4 
5 
7a 
12 
40 



6.7 X 10« 



6.3 X 10' 
3 X 10® 



1.2 X 10" 



9.5 X 10^ 



NA 

9.3 X 10^ 
1.9 X 10* 
1.8 X 10® 
3.3 X 10® 
1.5 X 10^ 



25 



Table 5A-5D represents a more detailed analysis 
of the fractions froa the second purification for each of 
the ej^riaents summarized in Table 4. Again, LFU/mI ie 
the recovery of the AdA viruses, whereas pfu/^l 
represents recovery of the helper virus. 







Table 


5A 




30 


Ad2 Fraction # 


VOLUME/Ul 


LFU/Ul 


PFU/Ul 




1 


120 


9532 


8 X 10® 




2 


100 


5.8 X 10* 


3 X 10* 


35 












3 


100 


8.24 X 10* 


6 X 10* 




4 


100 


9.47 X 10* 


1.2 X 10* 


40 


5 


100 


6 X 10* 


8 X 10* 




6 


100 


2 X 10* " 


6 X 10* 




7 


100 


5434 


5 X 10* 


45 












Total/ 10 pH 




3.32 X 10^ 


1.35 X lO' 



50 
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Table SB 

5 





M4 Fraction # 


VOLUHE/Ul 


LFU/Ul 


PFU/Ul 




1 


100 


1000 


1.7S X 10* 


10 


2 


100 


1.79 X 10* 


2.8 X 10* 




3 


100 


1.8 X 10* 


5.5 X 10* 


15 


4 


100 


2909 


1.25 X 10* 




5 


100 


920 


4 X 10* 




6 


100 


153 


3 X 10^ 


20 


Total/ 10 pH 




4 X 10* 


5.6 X lo' 


25 


MS Fraction # 










1 


120 


1.98 X 10* 


6 X 10* 


30 


2 


100 


5.8 X 10* 


3 X 10* 


3 


100 


1.2 X 10* 


1.5 X 10* 




4 


100 


1 X 10* 


1.4 X 10* 


35 


5 


100 


7.96 X 10* 


8 X 10* 




6 


100 


6860 


6 X 10* 




Total/ 10 pH 




3.88 X lO'^ 


1.2 X lO' 



40 
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Table 5C 



5 


Ad7 Fraction 




VOLUME/Ul 


LFU/Ul 


PFU/Ul 




1 




100 


1225 


5 X 10* 


10 


2 
3 




100 
100 


5550 
4938 


4 X 10* 
2 X 10* 




4 




100 


3866 


8 X 10* 


15 


5 




100 


4134 


6 X 10* 




6 




100 


995 


7 X 10* 


20 


7 

Total/ 10 pH 




100 


230 
2.09 X 10* 


6 X 10^ 
1.3 X 10® 


25 


MuBLUt Fraction 


# 










1 




100 


31 


5 X 10* 


30 


2 
3 




80 
80 


169 
245 


8.5 X 10* 
1.8 X 10* 




4 




110 


161 


1.1 X 10* 


35 


5 

Total/10 pH 




120 


62 

6.14 X 10* 


7 X lO' 
1.65 X 10® 
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Table 5D 



5 



15 



20 



M40 Fraction # 


VOLUHE/Ul 


T VTT /til 


trs vf **A 


1 


80 


61 


5 


2 


80 


184 


3 


3 


80 


199 


3 


4 


80 


168 


1 


5 


80 


122 




6 


100 


46 




7 


100 


32 




Total/ 10 pH 




6.65 X 10* 


1.1 X 10^ 



C. Characterization of Hhe Structure of PacKaqgd 
25 Viruses 

Aliquots of serial fractions were analysed by 
Southern blots using lacZ as a probe. In the case of Ad2 
and 5, not only the linearized nononer was packaged but 
multiple forms of recombinant virus with distinct sizes 

30 were found. These forms correlated well with the sizes 
of dimers, trimers and other higher molecular weight 
concatamers. The linearized monomers peaked closer to 
the top of tube (the defective adenovirus band) than 
other forms. When these forms were correlated with lacZ 

35 activity, a better correlation was found between the 
higher molecular weight forms than the monomers. With 
pseudotyping of Ad4 and Ad7, no linearized monomers were 
packaged and only higher molecular weight forms were 
found. 

40 These data definitively demonstrate the 

production and characterization of the A virus and the 
different pseudotypes. This example illustrat s a very 
simple way of generating pseudotype viruses. 
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Example 11 - AdA Vector Containing a FH Gene 

Familial hypercholesterolemia (FH) is an autosomal 
dominant disorder caused by abnormalities (deficiencies) 
in the function or expression of LDL receptors [M.S. 
5 Brown and J.L. Goldstein, Scife.ice . 222(4746) :34-37 
(1986); J.L. Goldstein and n.S. Brown, "Familial 
hypercholesterolemia" in Metabolic Basis of Inherited 
Disease, , ed. C.R. Scriver et al, McGraw Hill, Mew York, 
ppl215-1250 (1989).] Patients who inherit one abnormal 

10 allele have moderate elevations in plasma LDL and suffer 
premature life-threatening coronary artery disease (CAD) • 
Homozygous patients have severe hypercholesterolemia and 
life-threatening CAD in childhood. An FH-containing 
vector of the invention is constructed by replacing the 

15 lacZ minigene in the pAdAc.CMVlacZ vector with a minigene 
containing the LDL receptor gene [T. Yamamoto et al, 
Sfill, 22:27-38 (1984)] using known techniques and as 
described analogously for the dystrophin gene and CFTR in 
the preceding examples. Vectors bearing the LDL receptor 

20 gene can be readily constructed according to this 

invention. The resulting plasmid is termed pAdAc.CMV- 
LDL. 

This plasmid is useful in gene therapy of FH alone, 
or preferably, in the form of a conjugate prepared as 
25 described herein to substitute a normal LDL gene for the 
abnormal allele responsible for the gene. 
A. Ex Vivo Gene Therapv 

Ex vivo gene therapy can be performed by 
harvesting and establishing a primary culture of 
30 hepatocytes from a patient. Known techniques may be used 
to isolate and transduce the hepatocytes with the above 
vector (s) bearing the LDL receptor gene(8). For example, 
techniques of collagenase perfusion dev lop d for rabbit 
liver can b adapted for human tissu and us d in 
35 transduction. Following transduction, the h patocyt s 
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are removed f roip the tissue culture plates and reinfused 
into the patient using kno%fn techniques, e.g. via a 
catheter placed into the inferior mesenteric vein. 
B. Tn Vivo Gene Therapy 
5 Desirably, the in vl.o approach to gene 

therapy, e.g. liver-directed, involves the use of the 
vectors and vector conjugates described above. A 
preferred treatment involves infusing a vector LDL 
conjugate of this invention into the peripheral 
10 circulation of the patient. The patient is then 

evaluated for change in serum lipids and liver tissues. 

The virus or conjugate can be used to infect 
hepatocytes in vivo by direct injection into a peripheral 
or portal vein (10^-10® pfu/kg) or retrograde into the 
15 biliary tract (same dose) . This effects gene transfer 
into the majority of hepatocytes. 

Treatments are repeated as necessary, e.g. 
weekly. Administration of a dose of virus equivalent to 
an MOI of approximately 20 (i.e. 20 pfu/hepatocyte) is 
20 anticipated to lead to high level gene expression in the 
majority of hepatocytes. 

All references recited above are incorporated herein 
by reference. Numerous modifications and variations of 
the present invention are included in the above- 
25 identified specification and are expected to be obvious 
to one of skill in the art. Such modifications and 
alternations to the compositions and processes of the 
present invention, such as various modifications to the 
PAC sequences or the shuttle vectors, or to other 
30 sequences of the vector, helper virus and minigene 

components, are believed to be encompassed in the scope 
of the claims appended hereto. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Trustees of the University of Pennsylvania 

Wilson, Janes M. 
Fisher, Krishna J. 
Chen, Shu- Jen 
Weitzman, Matthew 

(ii) TITLE OF INVENTION: Improved Adenovirus and Methods 

of Use Thereof 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr, PO Box 457 

(C) CITY: Spring House 

(D) STATE: Pennsylvania^ 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(V) COMPUTER READABLE FOIOI: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC COBVatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/331,381 

(B) FILING DATE: 28-0CT-1994 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Bak, Mary E. . 

(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE/DOCKET NUMBER; 6NVPN.008PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-540-9200 

(B) TELEFAX: 215-540-5818 
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(2) INFORMATION FOR £EQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: cDNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GAACTCGAGC AGCTGAA6CT T6AATTCCAT CATCAATAAT ATACCTTATT 50 

TTGGATTGAA GCCAATATGA TAATGAGGG6 GTGGA6TTTG TGAC6TGGCG 100 

CGG6GCGTGG GAACGGGGCG GGTGACGTAG 6TTTTAGGGC GGAGTAACTT 150 

6TATGTGTTG GGAATTGTAG TTTTCTTAAA ATGGGAAGTT ACGTAACGTG 200 

G6AAAACGGA AGTGACGATT TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT 250 

TTCTGGGCGT AGGTTCGCGT GCGGTTTTCT GG6TGTTTTT T6TGGACTTT 300 

AACCGTTACG TCATTTTTTA GTCCTATATA TACTC6CTCT GCACTTGGCC 350 

CTTTTTTACA CTGTGACTGA TTGAGCTGGT GCCGTGTC6A GTGGT6TTTT 400 

TTTAATAGGT ' IT' l'UT'm' T T ACTGGTAAGG CTGACTGTTA GGCTGCCGCT 450 

6TGAAGC6CT GTAT6TTGTT CTGGAGCGGG AGGGTGCTAT TTTGCCTAGG 500 

CAGGAGGGTT TTTCAGGT6T TTATGTGTTT TTCTCTCCTA TTAATTTTGT 550 

TATACCTCCT ATGGGGGCTG TAATGTTGTC TCTACGCCTG CGGGTATGTA 600 

TTCCCCCCAA GCTTGCATGC CTGCAGGTCG ACTCTAGAGG ATCCGAAAAA 650 

ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA TGCAATTGTT 700 

GTTGTTAACT TGTTTATT6C AGCTTATAAT GGTTACAAAT AAAGCAATAG 750 

CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT TCTAGTTGTG 800 

GTTTGTCCAA ACTCATCAAT GTATCTTATC ATGTCTGGAT CCCCGCGGCC 850 

GCCTAGAGTC GAGGCCGAGT TTGTCAGAAA GCA6ACCAAA CAGCGGTTGG 900 

AATAATAGCG AGAACAGAGA AATAGCGGCA AAAATAATAC CCGTATCACT 950 

TTTGCTGATA TGGTTGATGT CATGTAGCCA AATCGGGAAA AACGGGAAGT 1000 
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AGGCTCCCAT 6ATAAAAAA6 TAAAAGAAAA 
AAGTTTGTGT TTTTTAAATA GTACATAAT6 
GG6CA6ACAT G6CCTGCCC6 GTTATTATTA 
GGTAATG6TA 606ACC66C6 CTCA6CT6TA 
CTCCAGGAGT CGTC6CCACC AATCCCCATA 
CCATGTGCCT TCTTCCGCGT GCAGCAGATG 
GTT6CT6TTG ACT6TA60G6 CT6AT6TT6A 
TGGT6TGGGC CATAATTCAA TTC6C6CGTC 
GCTCGGGAAG ACGTACGG66 TATACATGTC 
6GTCAAAACA GGCGGCAGTA AGGCGGTCGG 
AATCC6A6CC AGTTTACCCG CTCTGCTACC 
6CCAATC06C 6CCGGATGC6 GT6TATCGCT 
TAATC6CCAT TT6ACCACTA CCATCAATCC 
AATAAG6TTT TCCCCTGAT6 CT6CGA0606 
CGCATCA6CA AGT6TATCTG CCGTGCACTG 
GGTAATG6CC CGCCGCCTTC CAGC6TTCGA 
06G6TC6CTT CACTTAC6CC AATGTC6TTA 
CT6ATC6C6C A6CGGC6TCA GCA6TT6TTT 
GTGAAAGAAA GCCTGACTGG CGGTTAAATT 
TC6AT6CAAA AATCCATTTC GCTGGTGGTC 
06C6GCG666 A6C6TCACAC T6AGGTTTTC 
AGGCGCT6AT GT6CCCG6CT TCT6ACCAT6 
AC6CGTACTG TGAGCCA6AG TTGCCC6GCG 
AGGCAGTTCA ATCyVACTGTT TACCTTGTGG 
CACCGCTTGC CAGCGGCTTA CCATCCAGCG 
TCGTTATCGC TATGACG6AA CAGGTATTCG 



AGAATAAACC 


GAACATCCAA 


1050 


GATTTCCTTA 


CGCGAAATAC 


1100 


TTTTTGACAC 


CAGACCAACT 


1150 


A.TCC6CCGA 


TACT6ACGGG 


1200 


TGGAAACCGT 


CGATATTCAG 


1250 


GCGATGGCTG 


CTTTCCATCA 


1300 


ACTG6AA6TC 


GCC606CCAC 


1350 


CCGCA60GCA 


GACCGTTTVC 


1400 


TGACAATGGC 


A6ATCCCAGC 


1450 


gatAgttttc 


TTGCGGCCCT 


1500 


TGCGCCAGCT 


GGCAGTTCAG 


1550 


OGCCACTTCA 


ACATCAA06G 


1600 


G6TAG6TTTT 


CC6GCT6ATA 


1650 


TGAGCGGTCG 


TAATCA6CAC 


1700 


CAACAACGCT 


GCTTCGGCCT 


1750 


CCCAGGC6TT 


AGGGTCAATG 


1800 


TCCAGCGGTG 


CACGGGTGAA 


1850 


TTTATC6CCA 


ATCCACATCT 


1900 


GCCAACGCTT 


ATTACCCA6C 


1950 


AGAT6CGGGA 




^ uuu 


C6CCAGACGC 


CACT6CTGCC 


2050 


C66TC6C6TT 


C66TTGCACT 


2100 


CTCTCCGGCT 


G0GGTA6TTC 


2150 


AGC6ACATCC 


A6AGGCACTT 


2200 


CCACCATCCA 


GTGCAG6AGC 


2250 


CTGGTCACTT 


CGATGGTTTG 


2300 
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CCCGGATAAA CGGAACTGGA AAAACTGCTG 
GCGCTGGATG CGGCGTGCGG TCGGCAAAGA 
TGGCGATCGT TCGGCGTATC 6CCAAAATCA 
GTTGCC6TTT TCATCATATT TAATCAGCGA 
CGAAGCCGCC CTGTAAAC6G GGATACTGAC 
GCGAAACCGC CAAGACTGTT ACCCATCGCG 
CAGCGGGCGC GTCTCTCCAG GTAGCGAAAG 
TCGGCACA6C CGG6AAGGGC TGGTCTTCAT 
CAAATAATAT CGGT6GCCGT GGTGTCGGCT 
CGGGCGGGAA GGATCGACAG ATTTGATCCA 
TAGCGCC6TG GCCTGATTCA TTCCCCAGCG 
TGATTACGAT C6CGCT6CAC CATTCGCGTT 
TAGCCAGC6C GGATCATC66 TCAGACGATT 
TTTCAATATT GGCTTCATCC ACCACATACA 
6TGTACCACA GCGGATGGTT CGGATAATGC 
GTTGTTCTGC TTCATCAGCA GGATATCCTG 
TGACCTGACC ATGCAGAGGA TGATGCTOGT 
AGCAACGGCT TGCCGTTCAG CAGCAGCAGA 
GCGGAAACCG ACATCGCAGG CTTCTGCTTC 
TGTGCAGTTC AACCACCGCA CGATAGAGAT 
AGTTTCGGGT TTTCGACGTT CAGACGTAGT 
ACCACGCTCA TCGATAATTT CACCGCC6AA 
CCTGCGTTTC ACCCTGCCAT AAAGAAACTG 
AACTCGCCGC ACATCTGAAC TTCAGCCTCC 
ATCATTAAAG CGAGTGGCAA CATGGAAATC 
TATGCAGCAA CGAGACGTCA CGGAAAATGC 



CTGGTGTTTT 
CCAGACCGTT 
CCGCCGTAAG 
CTsTATCCACC 
GAAACGCCTG 
TGGGCX5TATT 
CCATTTTTTG 
CCACGCGCGC 
CCGCCGCCTT 
60GATACAGC 
ACCAGATGAT 
ACGCGTTCGC 
CATTGGCACC 
GGCCGTAGCG 
GAACAGCGCA 
CACCATCGTC 
GACGGTTAAC 
CCATTTTCAA 
AATCAGCGTG 
TCGGGATTTC 
GTGACGCGAT 
AGGCGCGGTG 
TTACCCGTAG 
AGTACAGCGC 
GCTGATTTGT 
CGCTCATCCG 



GCTTCCGTCA 
CATACAGAAC 
CCGACCACGG 
CAGTCCCAGA 
CCAGTATTTA 
CGCAAAGGAT 
ATGGACCATT 
GTACATC6GG 
CATACT6CAC 
GCGTCGTGAT 
CACACTCGGG 
TCATCGCCGG 
ATGCCGT6GG 
GTCGCACA6C 
CGGCGTTAAA 
TGCTCATCCA 
GCCTCGAATC 
TCCGCACCTC 
CCGTCGGCGG 
GGCGCTCCAC 
CGGCATAACC 
CCGCTGGCGA 
GTAGTCAC6C 
GGCTGAAATC 
GTAGTCGGTT 
CCACATATCC 



2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 

2750 

2800 

2850 

2900 

2950 

3000 

3050 

3100 

3150 

3200 

3250 

3300 

3350 

3400 

3450 

3500 

3550 

3600 
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T6ATCTTCCA GATAACTGCC GTCACTCCAA 
6C66TTTTCT CCGGC6C6TA AAAATGC6CT 
AAC6ACT6TC CTGGCC6TAA CC6ACCCA6C 
AAC6CC6A6T TAAC6CCATC AAAAATAATT 
CCAGCTTTCA TCAACATTAA ATGTGAGCGA 
CC6TGGGAAC AAACG6CG6A TTGACC6TAA 
TA6AT666C6 CATC6TAACC 6T6CATCT6C 
A6TATC66CC TCA66AA6AT C6CACTCGA6 
CTGGT6CCG6 AAACCA66CA AAGCGCCATT 
TGTTGGGAAG GGC6ATCGGT GCGGGCCTCT 
CAAAGGGGGA T6TGCTGCAA 6GC6ATTAAG 
CCCAGTCAC6 ACGTTGtAAA AC6AC666AT 
CTGGT6TCCA 6ACCAAT6CC TCCCA6AC06 
GTTGGTCAAA GTAAAC6ACA TGGTGACTTC 
TTTCGATCCC C6GGAATTGC G6CCGCGGGT 
6CA6AA6TAA CACTTCCGTA CAG6CCTAGA 
AG6AGCA6TT CTTT6ATTT6 CACCACCACC 
AAGACAAAAA GACTAAACTT ACCAGTTAAC 
CGAGTACCGG ATCCTCTAGA GTCCGGAGGC 
CTATGGA6GT CAAAACA6CG TG6AT66CGT 
CACTAAAC6A 6CTCT6CTTA TATAGACCTC 
CCATTTGCGT CAAT6G6GC6 GA6TT6TTAC 
TGATTTTGGT GCCAAAACAA ACTCCCATTG 
TGGAAATCCC CGTGAGTCAA ACCGCTATCC 
AAAACC6CAT CACCATG6TA ATAGC6AT6A 
CAA6TAG6AA AGTCCCATAA G6TCAT6TAC 



CGCAGCACCA 


TCACCGCGAG 


3650 


CA66TCAAAT 


TCAGACGGCA 


3700 


GCCC6TT6CA 


CCACA6ATGA 


3750 


CbCGTCTGGC 


CTTCCT6TAG 


3800 


GTAACAACCC 


GTCGGATTCT 


3850 


TGGGATAGGT 


TAC6TT6GTG 


3900 


CAGTTTGAGG 


GGACGAC6AC 


3950 


CCA6CTTTCC 


GGCACCGCTT 


4000 


CGCCATTCAG 


GCTGCGCAAC 


4050 


TCGCTATTAC 


GCCAGCTGGC 


4100 


TT6GGTAA0G 


CCAGG6TTTT 


4150 


CGCGCTT6AG 


CAGCTCCTT6 


4200 


GCAAC6AAAA 


TCAOGTTCTT 


4250 


TTTTTTGCTT 


TA6CAGGCTC 


4300 


ACAATTCCGC 


AGCTTTTAGA 


4350 


A6TAAAGGCA 


ACATCCACTG 


4400 


GGATCCGG6A 


CCT6AAATAA 


4450 


TTTCTGGTTT 


TTCAGTTCCT 


4500 


T6GATCGGTC 


CCGGTCTCTT 


4550 


CTCCAGGCGA 






CCACC6TACA 


CGCCTACC6C 


4650 


6ACATTTT6G 


AAAGTCCCGT 


4700 


ACGTCAAT66 


GGTGGAGACT 


4750 


ACGCCCATTG 


ATGTACTGCC 


4800 


CTAATACGTA 


GAT6TACTGC 


4850 


TG6GCATAAT 


GCCAGGC6GG 


4900 
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CCATTTACCG TCATTGACGT CAATAGGGGG C6TACTT6GC ATATGATACA 4950 

CTTGAT6TAC TGCCAAGT6G GCA6TTTACC GTAAATACTC CACCCATTGA 5000 

CGTCAATG6A AAGTCCCTAT TGGCGTTACT ATGGGAACAT AC6TCATTAT 5050 

TGAC6TCAAT GGGCGG6GGT CGTTGGGCGG TC^^GCCAGGC GGGCCATTTA 5100 

CCGTAAGTTA TGTAACGACC TGCAGGTCGA CTCTA6AGGA TCTCCCTAGA 5150 

CAAATATTAC GCGCTAT6AG TAACACAAAA TTATTCAGAT TTCACTTCCT 5200 

CTTATTCAGT TTTCCCGCGA AAATGGCCAA ATCTTACTCG GTTACGCCCA 5250 

AATTTACTAC AACATCCXXTC TAAAACC6CG OGAAAATTGT CACTTCCTGT 5300 

GTACACCGGC 6CACACCAAA AACGTCACTT TTGCCACATC CGTCGCTTAC 5350 

ATGTGTTCC6 CCACACTTGC AACATCACAC TTCCGCCACA CTACTAC6TC 5400 

ACCC6CCCC6 TTCCCACCCC CC6C6CCAC6 TCACAAACTTC CACCCCCTCA 5450 

TTATCATATT GGCTTCAATC CAAAATAAG6 TATATTATTG AT6ATGCTAG 5500 

CGAATTCATC 6AT6ATATCA 6ATCT6CC6G TCTCCCTATA GTGAGTC6TA 5550 

TTAATTTCGA TAAGCCAG6T TAACCTGCAT TAAT6AATCG GCCAACGCGC 5600 

G6GGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG 5650 

ACTCGCTGOS CTCGGTCOTT CGGCTGCG6C GAGCGGTATC AGCTCACTCA 5700 

AAGGCG6TAA TACGGTTATC CACAGAATCA 6G66ATAACG CAGGAAA6AA 5750 

CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 5800 

TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT 5850 

CGACGCTCAA GTCAGAGGTG GCGAAACCC6 ACAGGACTAT AAAGATACCA 5900 

GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC 5950 

CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 6000 

TCTCAAT6CT CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC 6050 

CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC OSCTGCGCCT 6100 

TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG 6150 

CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG 6200 
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CGGT6CTACA 6AGTTCTTGA AGTGGTGGCC 
66ACA6TATT TGGTATCT6C 6CTCT6CT6A 
A6A6TT66TA 6CTCTT6ATC C66CAAACAA 
TTTTTTT6TT T6CAA6CA6C AGATTACGOG 
AAGATCCTTT GATCTTTTCT ACGGGGTCTG 
TCAC6TTAAG G6ATTTTGGT CATGA6ATTA 
6ATCCTTTTA AATTAAAAAT 6AA6TTTTAA 
AGTAAACTTG GTCT6ACA6T TACCAATGCT 
TCAGCGATCT GTCTATTTCG TTCATCCATA 
GTAGATAACT ACGATAC66G AGGGCTTACC 
TGATACC6C6 AGACCCAC6C TCACOGGCTC 
CAGCCAGCCG 6AA6GGCC6A GC6CAGAAGT 
CTCCATCCA6 TCTATTAATT GTT6CCGGGA 
CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 
TCAC6CTCGT CGTTTGGTAT G6CTTCATTC 
AAGGCGAGTT ACATGATCCC CCATGTT6T6 
TCGGTCCTCC GATCGTTGTC A6AA6TAAGT 
ATG6TTATGG CA6CACT6CA TAATTCTCTT 
ATGCTTTTCT GTGACTGGTG AGTACTCAAC 
GTATGC6GCG ACCGAGTT6C TCTTGCCCGG 
GCGCCACATA 6CAGAACTTT AAAAGTGCTC 
GGGGC6AAAA CTCTCAA66A TCTTACOGCT 
AACCCACTCG TGCACCCAAC TGATCTTCAG 
GTTTCT6GGT GAGCAAAAAC AGGAAGGCAA 
AAGGGCGACA C6GAAATGTT 6AATACTCAT 
ATTGAAGCAT TTATCAGGGT TATTGTCTCA 



TAACTACGGC 


TACACTAGAA 


6250 


AGCCAGTTAC 


CTTCGGAAAA 


6300 


ACCACCGCTG 


CTAGCGGTGG 


6350 


CAGAAAAAAA 


G6ATCTCAA6 


6400 


ACGCTCAGTG 


GAACGAAAAC 


6450 


TCAAAAAG6A 


TCTTCACCTA 


6500 


ATCAATCTAA 


AGTATATAT6 


6550 


TAATCAGTGA 


GGCACCTATC 


6600 


GTTGCCTGAC 


TCCCC6TCGT 


6650 


ATCTGGCCCC 


AGTGCT6CAA 


6700 


CAGATTTATC 


AGCAATAAAC 


6750 


GGTCCTGCAA 


CTTTATCC6C 


6800 


AGCTAGAGTA 


AGTA6TTC6C 


6850 


TT6CTACA66 


CATCGTGGTG 


6900 


AGCTCCGGTT 


CCCAACGATC 


6950 


CAAAAAA6CG 


GTTAGCTCCT 


7000 


T6GC0GCA6T 


GTTATCACTC 


7050 


ACTGTCATGC 


CATCCGTAAG 


7100 


CAAGTCATTC 


TGAGAATAGT 


7150 


CGTCAATACG 


GvATAAX Awb 




ATCATT66AA 


AACGTTCTTC 


7250 


6TT6AGATCC 


AGTTC6ATGT 


7300 


CkTCTTTThC 


TTTCACCAGC 


7350 


AAT6CCGCAA 


AAAAGGGAAT 


7400 


ACTCTTCCTT 


TTTCAATATT 


7450 


TGAGCGGATA 


CATATTT6AA 


7500 
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TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA 7550 

A6T6CCACCT GAC6TCTAA6 AAACCATTAT TATCAT6ACA TTAACCTATA 7600 

AAAATAG6CG TATCAC6AGG CCCTTTCGTC TC6CGCGTTT C66T6AT6AC 7650 

G6TGAAAACC TCT6ACACAT GCAGCTCCCG 6AJA0GGTCA CAGCTT6TCT 7700 

GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGC6CG TCAGC6GGT6 7750 

TTG6CGGGTG TCG6GGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA 7800 

CT6AGA6TGC ACCATATGGA CATATTGTCG TTAGAACGC6 GCTACAATTA 7850 

ATACATAACC TTATGTATCA TACACATAC6 ATTTA66T6A CACTATA 7897 

(2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknoim 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAATTCGCTA 6CTAGCGG6G GAATACATAC CCGCAGGCGT AGAGACAACA 50 

TTACAGCCCC CATAG6A66T ATAACAAAAT TAATA66A6A 6AAAAACACA 100 

TAAACACCTG AAAAACCCTC CT6CCTA66C AAAATA6CAC CCTCCCGCTC 150 

CAGAACAACA TACAGCGCTT CACAGCGGCA GCCTAACA6T CAGCCTTACC 200 

AGTAAAAAAG AAAACCTATT AAAAAAACAC CACTCGACAC GGCACCAGCT 250 

CAATCAGTCA CAGTGTAAAA AAGGGCCAAG TGCAGAGCGA GTATATATAG 300 

GACTAAAAAA TGACGTAACG GTTAAAGTCC ACAAAAAACA CCCAGAAAAC 350 

CGCACGCGAA CCTACGCCCA GAAACGAAAG CCAAAAAACC CACAACTTCC 400 

TCAAATCGTC ACTTCCGTTT TCCCACGTTA CGTAACTTCC CATTTTAAGA 450 

AAACTACAAT TCCCAACACA TACAA6TTAC TCCGCCCTAA AACCTACGTC 500 

ACCCGCCCCG TTCCCACGCC CCGC6CCACG TCACAAACTC CACCCCCTCA 550 
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TTATCATATT G6CTTCAATC CAAAATAAGG 
CATCATCAAT AATATACCTT ATTTTGGATT 
GG66T66A6T TT6T6ACX3T6 6C6CGGG6C6 
TA6TA6T6T6 6C66AAOT6T 6AT6TT6GAA 
6CGACG6AT6 TG6CAAAAGT GACGTTTTTG 
6AAGTGACAA TTTTCGCGCG 6TTTTAGGCG 
CGTAACCGAG TAA6ATTTG6 CCATTTT06C 
A6T6AAATCT (SAATAATTTT 6T6TTACTCA 
66GAGATCA6 CCTGCA6GTC GTTACATAAC 
66CT6ACCGC CCAACGACCC CC6CCCATT6 
TCCCATAGTA AC6CCAATAG 6GACTTTCCA 
ATTTAC66TA AACTGCCGAC TT6GCAGTAC 
AGTACGCCCC CTATTGAC6T CAAT6AC6GT 
T6CCCAGTAC AT6ACCTTAT GGGACTTTCC 
TATTA6TCAT CGCTATTACC ATGGT6ATGC 
GGGCGT6GAT AGCGGTTT6A CTCACGGGGA 
T6ACGTCAAT GGGAGTTTGT TTTGGCACCA 
AAT6TC6TAA CAACTCCGCC CCATT6AC6C 
CGGTGG6AGG TCTATATAAG CAGAGCTC6T 
CTGGAGACGC CATCCAC6CT GTTTT6ACCT 
6ATCCAGCCT CCG6ACTCTA GAGGATCXGG 
CAGAAA6TTA ACTGGTAAGT TTAGTCTTTT 
GGATCCGGT6 GTGGTGCAAA TCAAAGAACT 
TTTACTTCTA GGCCTGTACG GAAGTGTTAC 
AATTGTACCC GCGGCCGCAA TTCCCGGGGA 
AAAAAAGAAG TCACCAT6TC GTTTACTTT6 



TATATTATTG 


ATGATGCTAG 


600 


6AA6CCAATA 


TGATAAT6A6 


650 


TG66AA0GG6 


GCGGGTGACG 


700 


6*xGTG6CG6A 


ACACAT6TAA 


750 


6TGTGCGCCG 


GTGTACACAG 


800 


GATGTTGTAG 


TAAATTTGGG 


850 


GGGAAAACT6 


AATAAGAGGA 


900 


TAGCGCGTAA 


TATTT6TCTA 


950 


TTACGGTAAA 


TGGCCC6CCT 


1000 


ACGTCAATAA 


T6ACGTATGT 


1050 


TTGACGTCAA 


T6GGTG6AGT 


1100 


ATCAAGT6TA 


TCATATGCCA 


1150 


AAATGGCCCG 


CCTGGCATTA 


1200 


TACTTGGCAG 


TACATCTACG 


1250 


GGTTTTGGCA 


GTACATCAAT 


1300 


TTTCCAAGTC 


TCCACCCCAT 


1350 


AAATCAACGG 


GACTTTCCAA 


1400 


AAATGGGCGG 


TAGGCGTGTA 


1450 


TTAGTGAACC 


GTCAGATCGC 


1500 


CCATAGAAGA 


CACCGG6ACC 


1 CCA 

X99U 


TACTC6AGGA 


ACTGAAAAAC 


1600 


T6TCTTTTAT 


TTCA6GTCCC 


1650 


GCTCCTCAGT 


GGATGTTGCC 


1700 


TTCTGCTCTA 


AAA6CT6C6G 


1750 


TCGAAAGAGC 


CTGCTAAAGC 


1800 


ACCAACAAGA 


ACGTGATTTT 


1850 
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CGTTGCCGGT CTGGGAGGCA TTGGTCTGGA CACCAGCAAG GAGCTGCTCA 1900 

AGCGCGATCC CGTCGTTTTA CAACXSTCGTG ACTGGGAAAA CCCTGGCGTT 1950 

ACCCAACTTA ATCGCCTT6C AGCACATCCC CCTTTCGCCA GCTGGCGTAA 2000 

TAGCGAAGAG GCCCGCACCG ATCGCCCTTC CCAACAGTTG C6CAGCCTGA 2050 

ATG6C6AATG GCGCTTTGCC TGGTTTCCGG CACCAGAAGC GGTGCCG6AA 2100 

AGCTGGCTGG AGTGCGATCT TCCTGAG6CC GATACTGTCG TCGTCCCCTC 2150 

AAACTGGCAG ATGCACGGTT ACGATGOGCC CATCTACACC AACGTAACCT 2200 

ATCCCATTAC GGTCAATCCG CCGTTTGTTC CCACGGA6AA TCCGACGGGT 2250 

TGTTACTCGC TCACATTTAA T6TTGAT6AA AGCTG6CTAC AGGAAG6CCA 2300 

GACGCGAATT ATTTTTGATG GCGTTAACTC GGCGTTTCAT CTCTGGTGCA 2350 

ACGG6C6CT6 G6TCG6TTAC GGCCAGGACA GTCGTTTGCC GTCTGAATTT 2400 

GACCT6A606 CATTTTTACG OGCCGGAGAA AACC6CCTCG CGGTGAT66T 2450 

6CTGCGTTG6 A6TGAC6GCA GTTATCTGGA AGATCAG6AT ATGTGGCGGA 2500 

TGAGCGGCAT TTTCCGTGAC GTCTCGTTGC T6CATAAACC GACTACACAA 2550 

ATCAGCGATT TCCAT6TT6C CACTCGCTTT AATGATGATT TCAGCCGCGC 2600 

TGTACT66AG 6CT6AA6TTC AGATGTGC66 CGAGTTGCGT GACTACCTAC 2650 

GGGTAACA6T TTCTTTATGG CAGG6TGAAA OGCAGGTCGC CAGCGGCACC 2700 

GCGCCTTTCG GCGGT6AAAT TATCGAT6AG CGTGGTGGTT ATGCCGATCG 2750 

CGTCACACTA CGTCTGAACG TCGAAAACCC GAAACTGTGG AGCGCCGAAA 2800 

TCCCGAATCT CTATCGTGCG GTGGTTGAAC TGCACACCGC CGACGGCACG 2850 

CTGATTGAAG CAGAA6CCTG CGATGTCGGT TTCCGCGAGG TGCG6ATTGA 2900 
AAATGGTCTG CT6CT6CT6A ACGGCAAGCC GTTGCTGATT CGAGGCGTTA , 2950 

ACCGTCACGA GCATCATCCT CTGCATGGTC AGGTCATGGA TGAGCAGACC 3000 

ATGGTGCAGG ATATCCTGCT GATGAAGCAG AACAACTTTA ACGCCGTGCG 3050 

CTGTTCGCAT TATCCGAACC ATCCGCTGTG GTACACGCTG TGCGACCGCT 3100 

ACGGCCT6TA TGTGGTGGAT GAAGCCAATA TTGAAACCCA CGGCATGGTG 3150 
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CCAAT6AATC GTCT6ACCGA TGATCC6CGC 
ACGC6TAAC6 CGAAT6GTGC AGCGCGATC6 
TCTGCTCGCT GGGGAATGAA TCAGGCCACG 
TATCGCTG6A TCAAATCT6T C6ATCCTTCC 
CGGCGGAGCC GACACCACGG CCACCGATAT 
GCGTGGATGA AGACCAGCCC TTCCCGGCTG 
AAATGGCTTT CGCTACCTGG AGAGACGCGC 
06CCCACGC6 ATGGGTAACA 6TCTTGGCGG 
CGTTTCGTCA GTATCCCCGT TTACAGGGCG 
GATCAGTCGC TGATTAAATA TGATGAAAAC 
CGGCGGTGAT TTT6GCGATA CGCC6AACGA 
GTCTG6TCTT TGCC6ACCGC ACGC06CATC 
CACCAGCAGC AGTTTTTCCA GTTCCGTTTA 
GACCAGCGAA TACCTGTTCC GTCATA6CGA 
TGGTGGCGCr GGATGGTAA6 CCGCTGGCAA 
GTCGCTCCAC AAGGTAAACA GTTGATTGAA 
GGAGAGCGCC GGGGAACTCT GGCTCACAGT 
CGACCGCATG GTCAGAAGCC GGGCACATCA 
CTGGCGGAAA ACCTCAGTGT GACGCTCCCC 
GCATCTGACC ACCAGCGAAA TGGATTTTTG 
6TTGGCAATT TAACCGCCA6 TCAGGCTTTC 
GATAAAAAAC AACT6CTGAC GCCGCTGCGC 
GCTGGATAAC GACATTGGC6 TAAGTGAA6C 
CCTGGGTCGA ACGCTGGAAG GCGGC6GGCC 
TTGTT6CAGT GCACG6CAGA TACACTTGCT 
CGCTCACGCG TGGCA6CATC A6GGGAAAAC 



TG6CTAC06G 


CGATGAGC6A 


3200 


TAATCACCC6 


AGT6TGATCA 


3250 


6C6CTAATCA 


C6AC6CGCT6 


3300 


CuCCCGGTGC 


A6TAT6AA6G 


3350 


TATTT6CCC6 


ATGTACGCGC 


3400 


TGCCGAAATG 


6TCCATCAAA 


3450 


C06CTGATCC 


TTTGC6AATA 


3500 


TTTCGCTAAA 


TACTGGCA6G 


3550 


GCTTC6TCTG 


G6ACTGG6TG 


3600 


66CAACCCGT 66TCGGCTTA 


3650 


TOGCCAGTTC 


T6TAT6AACG 


3700 


CAGC6CTGAC 


G6AAGCAAAA 


3750 


TCC666CAAA 


CCATCGAAGT 


3800 


TAAC6A6CTC 


CTGCACTG6A 


3850 


6CG6T6AA6T 


6CCTCT66AT 


3900 


CTGCCT6AAC 


TACCGCA6CC 


3950 


A06C6TA6T6 


CAACC6AACG 


4000 


6C6CCTG6CA 


GCA6T66CGT 


4050 


6CCGCGTCCC 


ACGCCATCCC 


4100 


CATCGAGCT6 


GGTAATAAGC 


4150 


TTTCACA6AT 


GTGGATTG6C 


4200 


6ATCAGTTCA 


CCC6T6CACC 


4250 


GACCCGCATT 


GACCCTAACG 


4300 


ATTACCAGGC 


CGAA6CAGCG 


4350 


6ATGCG6TGC 


TGATTAC6AC 


4400 


CTTATTTATC 


AGCCG6AAAA 


4450 
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CCTACCGGAT TGATGGTAGT 6GTCAAATGG 
6TG6C6AGCG ATACACCGCA TCCGGCGCGG 
6GC6CAGGTA GCAGA6C6GG TAAACTGGCT 
ACTATCCC6A CCGCCTTACT GCCGCCTGTT 
TTGTCAGACA TGTATACCCC GTACGTCTTC 
CTGCGGGACG CGCGAATTGA ATTATGGCCC 
TCCAGTTCAA CATCAGCCGC TACAGTCAAC 
CATCGCCATC TGCTGCACGC GGAAGAAGGC 
TTTCCATATG GG6ATTGGTG GC6AC6ACTC 
CGGAATTACA GCTGAGCGCC GGTCGCTACC 
CAAAAATAAT AATAACCGGG CAGGCCATGT 
GAAATCCATT ATGTACTATT TAAAAAACAC 
TATTCTTTTT CTTTTACTTT TTTATCATGG 
CCGATTTGGC TACATGACAT CAACCATATC 
TATTTTTGCC GCTATTTCTC TGTTCTCGCT 
GTCTGCTTTC TGACAAACTC GGCCTCGACT 
AGACATGATA AGATACATTG ATGAGTTTGG 
AGTGAAAAAA ATGCTTTATT TGT6AAATTT 
GTAACCATTA TAAGCTGCAA TAAACAAGTT 
TTTTATGTTT CAGGTTCAGG G6GA66TGTG 
TAGAGTCGAC GACGOGAGGC TGGATGGCCT 
GCTTCCGGCG GCATCGGGAT GCCCGCGTTG 
GGTAGATGAC GACCATCAGG GACAGCTTCA 
CCAGCCTAAC TTCGATCACT GGACCGCTGA 
GCCTCGGCGA GCACATGGAA CGGGTTGGCA 
ATACCTTGTC TGCCTCCCCG CGTTGCGTCG 



CGATTACCGT 


TGATGTTGAA 


4500 


ATTGGCCTGA ACT6CCAGCT 


4550 


CX3GATTAGGG 


CCGCAA6AAA 


4600 


TTUACCGCTG 


GGATCTGCCA 


4650 


CCGAGCGAAA 


ACGGTCTGCG 


4700 


ACACCAGTGG 


CGCGGCGACT 


4750 


A6CAACTGAT 


G6AAACCAGC 


4800 


ACATGGCTGA ATATCGA06G 


4850 


CTGGAGCCCG 


TCAGTATCGG 


4900 


ATTACCAGTT 


GGTCTGGT6T 


4950 


CTGCCCGTAT 


TTCGCGTAAG 


5000 


AAACTTTTGG 


ATGTTCGGTT 


5050 


GAGCCTACTT 


CCCGTTTTTC 


5100 


AGCAAAAGTG 


ATACGGGTAT 


5150 


ATTATTCCAA 


CCGCTGTTTG 


5200 


CTAG6CGGCC 


GCGGGGATCC 


5250 


ACAAACCACA ACTAGAATGC 


5300 


GTGATGCTAT 


TGCTTTATTT 


5350 


AACAACAACA 


ATTGCATTCA 


5400 


GGAGGTTTTT 


TCGGATCCTC 




TCCCCATTAT 


GATTCTTCTC 


5500 


CAGGCCATGC 


TGTCCAGGCA 


5550 


AGGATCGCTC GCGGCTCTTA 


5600 


TCGTCACGGC 


GATTTATGCC 


5650 


TGGATTGTAG 


GCGCCGCCCT 


5700 


CGGTGCATGG 


AGCCGGGCCA 


5750 
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CCTC6ACCT6 AATG6AA6CC GGC66CACCT 
CAAGAATT66 A6CCAATCAA TTCTT6CG6& 
AACCCTTGGC AGAACATATC CATC606TCC 
6C6GC6CATC TCGGGCAGCG TTGG6TCCT6 
TGCTCCT6TC GTT6AG6ACC CGGCTAGCCT 
TAGCAGAATG AATCACC6AT AC6C6A6C6A 
CAAAAC6TCT GC6ACCT6A6 CAACAACAT6 
TTT0GTAAA6 TCT6GAAAC6 C66AA6TCAG 
CGGATCTGCA TCGCAGGATG CTGCTGGCTA 
T6TATTAACG AAGCCTTTCT CAATGCTCAC 
6T6TA66TC6 TTCGCTCCAA 6CTGGGCTGT 
6CCC6A00GC T6CGCCTTAT CC66TAACTA 
TAA6ACAC6A CTTATC6CCA CTGGCA6CA6 
AGAGCGAGGT ATGTAGGCGG TGCTACAGAG 
CTACGGCTAC ACTAGAAGGA CA6TATTT66 
CAGTTACCTT C66AAAAAGA 6TT6GTA6CT 
ACCGCT66TA GCG6TG6TTT TTTT6TTT6C 
AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT 
CTCAGTGGAA CGAAAACTCA CGTTAAGGGA 
AAAAG6ATCT TCACCTAGAT CCTTTTAAAT 
AATCTAAAGT ATATAT6AGT AAACTTG6TC 
TCA6T6AG6C ACCTATCTCA GCGATCT6TC 
GCCTGACTCC CCGTCGTGTA GATAACTACG 
TGGCCCCAGT GCTGCAAT6A TACCGCGA6A 
ATTTATCAGC AATAAACCAG CCA6C0GGAA 
CCTGCAACTT TATCCGCCTC CATCCAGTCT 



CGCTAAC6GA 


TTCACCACTC 


5800 


6AACT6TGAA 


T6C6CAAACC 


5850 


6CCATCTCCA 


6CA6CC6CAC 


5900 


G0CAC6GGT6 


C6CAT6ATC6 


5950 


G6CG6G6TTG 


CCTTACT6GT 


6000 


AC6T6AA6C6 


ACT6CT6CT6 


6050 


AAT6GTCTTC 


GGTTTCCGTG 


6100 


CGCCCTGCAC 


CATfTATGTTC 


6150 


CCCTGTGGAA 


CACCTACATC 


6200 


6CTGTA6GTA 


TCTCAGTTCG 


6250 


GTGCAC6AAC 


CCCCCGTTCA 


6300 


TCGTCITGAC 


TCCAACCCGG 


6350 


CCACTGGTAA 


CAGGATTAGC 


6400 


TTCTTGAAGT 


66TGGCCTAA 


6450 


TATCTCOGCT 


CTGCTGAA6C 


6500 


CTTGATCCGG 


CAAACAAACC 


6550 


AAGCAGCAGA 


TTACGCGCAG 


6600 


W X A A X W AMW^ysf 


GGGTCT6ACG 


6650 


TTTTGGTCAT 


GAGATTATCA 


6700 


TAAAAATGAA 


6TTTTAAATC 


6750 


T6ACAGTTAC 


CAATGCTTAA 


6800 


TATTTCGTTC 


ATCCATA6TT 


6850 


ATACGGGAGG 


GCTTACCATC 


6900 


CCCACGCTCA 


CCGGCTCCAG 


6950 


6GGCCGAGCX3 


CA6AAGTGGT 


7000 


ATTAATT6TT 


GCCGGGAAGC 


7050 
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TA6A6TAA6T A6TTC6CCAG TTAATAGTTT 
CTGCA66CAT CGTGGTGTCA CGCTCGTCGT 
TCCG6TTCCC AAC6ATCAAG GC6A6TTACA 
AAAAGCGGTT AGCTCCTTCG GTCCTCCGAT 
CCGCAGTGTT ATCACTCATG GTTATGCCAG 
GTCAT6CCAT CCGTAA6ATG CTTTTCT6TG 
GTCATTCT6A 6AATA6TGTA T6CGGCGACC 
CAACAC6GGA TAATACCGC6 CCACATA6CA 
ATTGGAAAAC GTTCTTCGGG GCGAAAACTC 
GAGATCCAGT TCGATGTAAC CCACTCGTGC 
CTTTTACTTT CACCAGCGTT TCTGGGT6AG 
GCCGCAAAAA AGGGAATAAG GGCGACACGG 
CTTCCTTTTT CAATATTATT GAAGCATTTA 
GCGGATACAT ATTTGAATGT ATTTAGAAAA 
CGCACATTTC CCCGAAAAGT GCCACCTGAC 
CATGAGATTA ACCTATAAAA ATAGGCGTAT 
AA 



GCGCAACGTT GTTGCCATTG 


7100 


TTGGTATGGC TTCATTCA6C 


7150 


TCATCCCCCA TGTTGTGCAA 


7200 


CGITGTCAGA AGTAAGTTGG 


7250 


CACTGCATAA TTCTCTTACT 


7300 


ACTGGTGAGT ACTCAACCAA 


7350 


6AGTTGCTCT TGCCOGGCGT 


7400 


CAACTTTAAA AGTGCTCATC 


7450 


TCAAGGATCT TACCGCTGTT 


7500 


ACCCAACTGA TCTTCAGCAT 


7550 


CAAAAACAGG AAGGCAAAAT 


7600 


AAATGTTGAA TACTCATACT 


7650 


TCA6GGTTAT TGTCTCAT6A 


7700 


ATAAACAAAT AGGGGTTCCG 


7750 


GTCTAAGAAA CGATTATTAT 


7800 


CACGAGGCCC TTTCGTCTTC 


7850 




7852 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: un)cno%m 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG 
GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT 
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CAGGG6ATAA C6CA66AAA6 AACAT6TGA6 
A66AAC06TA AAAA66CC6C 6TT6CT6606 
CCCT6AC6A6 CATCACAAAA ATC6A06CTC 
C6ACA66ACT ATAAA6ATAC CA66C6TTTC 
C6CTCTCCTG TTCCGACCCT GCCGCTTACC 
CCCTTCGG6A AG0GTG6CGC TTTCTGATAG 
GTTCGGTGTA G6TCGTTCGC TCCAA6CT66 
6TTCA6CCCG ACC6CTGCGC CTTATCCG6T 
CCC6GTAA6A CAC6ACTTAT C6CCACT66C 
TTAGCAGAGC GAGGTATGTA 66C6GT6CTA 
CCTAACTAC6 GCTACACTAG AA6AACA6TA 
GAA6CCA6TT ACCTTCG6AA AAA6AGTTGG 
AAACCACCGC T66TAGC6GT G6TTTTTTT6 
CGCAGAAAAA AA66ATCTCA AGAAGATCCT 
TGACGCTCAG TGGAACGAAA ACTCACGTTA 
TATCAAAAAG 6ATCTTCACC TA6ATCCTTT 
AAATCAATCT AAAGTATATA T6AGTAAACT 
CTTAATCAGT GAGGCACCTA TCTCAGCGAT 
TAGTTGCCTG ACTCCCCGTC GTGTAGATAA 
CCATCTGGCC CCAGTGCTGC AATGATACCG 
TCCAGATTTA TCAGCAATAA ACCAGCCA6C 
6TG6TCCT6C AACTTTATCC GCCTCCATCC 
6AA6CTA6A6 TAA6TA6TTC GCCA6TTAAT 
CATTGCTACA GGCATCGTGG TGTCACXXrrC 
TCAGCTCCGC TTCCCAAC6A TCAAGGCGAG 
TGCAAAAAA6 C66TTA6CTC CTTCGGTCCT 



CAAAA66CCA 


GCAAAAGGCC 


150 


TTTTTCCATA 


GGCTCCGCCC 


200 


AAGTCA6A66 


T6GC6AAACC 


250 


CCCCT6GAA6 


CTCCCTCGTG 


300 


GGATACCTGT 


CCGCCTTTCT 


350 


CTCAC6CT6T 


AG6TATCTCA 


400 


6CT6T6T6CA 


CGAACCCCCC 


450 


AACTATC6TC 


TTGAGTCCAA 


500 


A6CAGCCACT 


GGTAACAGGA 


550 


CA6A6TTCTT 


GAAGTGGTGG 


600 


TTTGGTATCT 


GCGCTCTGCT 


650 


TA6CTCTT6A 


TCC66CAAAC 


700 


TTT6CAA6GA 


GCAGATTACG 


750 


TTGATCTTTT 


CTACGGGGTC 


800 


AGGGATTTTG 


GTCATGAGAT 


850 


TAAATTAAAA 


ATGAAGTTTT 


900 


TGGTCTGACA 


GTTACCAATG 


950 


CT6TCTATTT 


CGTTCATCCA 


1000 


CTACGATACG 


GGAGGGCTTA 


1050 


^^^^^ m ^^^^^^^ 

CCAGACCCAC 


GCTCACCGGC 


1100 


CGGAAGGGCC 


GAGCGCAGAA 


1150 


AGTCTATTAA 


TTGTTGCCGG 


1200 


AGTTTGCGCA 


ACGTT6TT6C 


1250 


GTCGTTTGGT 


ATGGCTTCAT 


1300 


TTACATGATC 


CCCCATGTTG 


1350 


CCGATCGTTG 


TCA6AAGTAA 


1400 
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GTTGGCCGCA GTGTTATCAC TCATGGTTAT 
TTACTGTCAT GCCATCOGTA AGATGCTTTT 
ACCAA6TCAT TCTGAGAATA 6TGTATGC6G 
GGCGTCAATA C6GGATAATA CCGCGCGACA 
TCATCATTGG AAAACGTTCT TCGGGGCGAA 
CTGTTGAGAT CCAGTTCGAT GTAACCCACT 
AGCATCTTTT ACTTTGACCA GC6TTTCTG6 
AAAATGCCGC AAAAAAGGGA ATAAGGGCGA 
ATACTCTTCC TTTTTCAATA TTATTGAAGC 
CATGAGCGGA TACATATTTG AATGTATTTA 
TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC 
ATTATCATGA CATTAACCTA TAAAAATAGG 
TCTCGCGC6T TTCGGTGATG ACGGTGAAAA 
CGGAGACGGT CACAGCTTGT CTGTAAGCG6 
CGTCAGGGCG CGTCAGCGGG TGTTG6CGGG 
TGCGGCATCA GAGCA6ATTG TACTGAGAGT 
TTAATATTTT GTTAAAATTC GCGTTAAATT 
TTTAACCAAT AGGCC6AAAT CGGCAAAATC 
GCCCGAGATA GGGTTGAGTG TTGTTCCAGT 
TAAAGAACGT GGACTCCAAC GTCAAAGGGC 
GATG6CCCAC TACGTGAACC ATCACCCAAA 
GTGCCGTAAA GCACTAAATC GGAACCCTAA 
CTTGACGGG6 AAAGCCGGC6 AACGTG606A 
AAAGGAGCGG GCGCTA6GGC GCTGGCAAGT 
AACCACGACA CCCGCCGCGC TTAATGCGCC 
GTTGCTTTGA C6TATGCGGT GTGAAATACC 



GGCAGCACTG 


CATAATTCTC 


1450 


CT6TGACTG6 


TGAGTACTCA 


1500 


C6ACC6A6TT 


6CTCTTGCCC 


1550 


TAGCA6AACT 


TTAAAA6T6C 


1600 


AACTCTCAAG 


6ATCTTACC6 


1650 


C6T6CACCCA 


ACTGATCTTC 


1700 


6T6A6CAAAA ACAGGAAG6C 


1750 


CACGGAAAT6 


TTGAATACTC 


1800 


ATTTATCAG6 


GTTATTGTCT 


1850 


6AAAAATAAA 


CAAATAGG6G 


1900 


CTGACGTCTA AGAAACCATT 


1950 


CGTATCAC6A 


GGCCCTTTCG 


2000 


CCTCTGACAC 


AT6CA6CTCC 


2050 


ATGCC6GGAG 


CAGACAAGCC 


2100 


TGTCGGGGCT 


GGCTTAACTA 


2150 


GCACCATAAA ATTGTAAACG 


2200 


TTT6TTAAAT 


CAGCTCATTT 


2250 


CCTTATAAAT 


CAAAAGAATA 


2300 


TTG6AACAAG 


AGTCCACTAT 


2350 


GAAAAACC6T 


CTATCAGGGC 


2400 


TCAAGTTTTT 


TGGGGTCGAG 


2450 


AGGGAGCCCC 


0GATTTAGA6 


2500 


GAAA6GAAG6 


GAA6AAAGCG 


2550 


GTAGCGGTCA 


C6CTGCGCGT 


2600 


GCTACAGGGC 


GCGTACTATG 


2650 


6CACAGATGC 


GTAAGGA6AA 


2700 
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AATACC6CAT 


CAGGCGCCAT 


TCGCCATTCA 


GGCTGCGCAA 


CTGTTGGGAA 


2750 


6G6C6ATC66 


T6CG66CCTC 


TTCGCTATTA 


C6CCAGCT6G 


C6AAAGGGG6 


2800 


AT6T6CT6CA 


A6GC6ATTAA 


GTTGG6TAAC 


6CCAGG6TTT 


TCCCA6TCAC 


2850 


6AC6TT6TAA 


AAC6AC66CC 


AGT6CCAA6C 


1\l'AA6GT6CA 


CG6CCCA0GT 


2900 


GGCCACTA6T 


ACTTCTCGA6 


CTCTGTACAT 


GTCCGCGGTC 


GCGAC6TAC6 


2950 


CGTATC6ATG 


GC6CCA6CT6 


CAGGCGGCCG 


CCATATGCAT 


CCTA6GCCTA 


3000 


TTAATATTCC 


G6AGTATACG 


TA6CC6GCTA 


ACGTTAACAA 


CCGGTACCTC 


3050 


TA6AACTATA 


GCTAGCCAAT 


TCCATCATCA 


ATAATATACC 


TTATTTT6GA 


3100 


TTGAA6CCAA 


TAT6ATAAT6 


A6GGGGT6GA 


GTTT6TGACG 


T66C6C666G 


3150 


C6TG66AAC6 


GGGCGG6TGA 


CGTAGGTTTT 


AGGGCGGAGT 


AACTTGTAT6 


3200 


TGTTGGGAAT 


TGTA6TTTTC 


TTAAAATGGG 


AAGTTACGTA 


AC6TGG6AAA 


3250 


A06GAA6T6A 


C6ATTT6AGG 


AA6TTGTGG6 


TTTTTT66CT 


TTC6TTTCTC 


3300 


6GC6TA66TT 


C6CGT6C6GT 


TTTCTG6GT6 


TTTTTT6T66 


ACTTTAACC6 


3350 


TTACGTCATT 


TTTTAGTCCT 


ATATATACTC 


GCTCTGCACT 


TGGCCCTTTT 


3400 


TTACACT6T6 


ACT6ATT6AG 


CTGGTGCCGT 


GTCGAGTGGT 


6TTTTTTTAA 


3450 


TA66TTTTCT 


TTTTTACTGG 


TAAGGCTGAC 


TGTTAGGCTG 


C06CT6TGAA 


3500 


6C6CT6TAT6 


TTGTTCTGGA 


GCG66A666T 


6CTATTTTGC 


CTAGGCAGGA 


3550 


GGGTTTTTCA 


G6T6TTTAT6 


TGTTTTTCTC 


TCCTATTAAT 


TTT6TTATAC 


3600 


CTCCTATGGG 


GGCTGTAATG 


TTGTCTCTAC 


GCCTGCGGGT 


ATGTATTCCC 


3650 








AUtAwwA A w W V 


AAAAA&CCTC 


3700 


CCACACCTCC 


CCCT6AACCT 


GAAACATAAA 


AT6AAT6CAA 


TT6TT6TT6T 


3750 


TAACTT6TTT 


ATT6CA6CTT 


ATAATG6TTA 


CAAATAAA6C 


AATAGCATCA 


3800 


CAAATTTCAC 


AAATAAAGCA 


TTTTTTTCAC 


TGCATTCTA6 


TT6TG6TTTG 


3850 


TCCAAACTCA 


TCAATGTATC 


TTATCATGTC 


TGGATCCCCC 


TAGCTTGCCA 


3900 


AACCTACA66 


TG6G6TCTTT 


CATTCCCCCC 


TTTTTCT66A 


GACTAAATAA 


3950 


AATCTTTTAT 


TTTATCTATG 


6CTCGTACTC 


TATA66CTTC 


A6CT66T6AT 


4000 
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ATTGTTGA6T CAAAACTAGA GCCTGGACCA CTGATATCCT GTCTTTAACA 4050 

AATTGGACTA AT0GCG66AT CAGCCAATTC CATGAGCAAA TGTCCCATGT 4100 

CAACATTTAT GCTGCTCTCT AAA6CCTT6T ATCTTGCATC TCTTCTTCTG 4150 

TCTCCTCTTT CA6A6CA6CA ATCTGGGGCT TAiSACTTGCA CTTGCTTGAG 4200 

TTCCGGTGGG GAAAGAGCTT CACCCTGTCG GA6GGGCT6A TGGCTTGCCG 4250 

6AA6A6GCTC CTCTC6TTCA GCAGTTTCTG GATG6AATCG TACTGCCGCA 4300 

CTTTGTTCTC TTCTAT6ACC AAAAATT6TT GGCATTCCA6 CATT6CTTCT 4350 

ATCCTGT6TT CACA6A6AAT TACTGTGCAA TCA6CAAAT6 CTTGTTTTAG 4400 

AGTTCTTCTA ATTATTTG6T ATGTTACTGG ATCCAAAT6A 6CACT66GTT 4450 

CATCAA6CA6 CAA6ATCTTC 6CCTTACTGA GAACAGATCT A6CCAAGCAC 4500 

ATCAACT6CT TGTGGCCATG 6CTTAGGACA CAGCCCCCAT CCACAA6GAC 4550 

AAA6TCAA6C TTCCCAGGAA ACTGTTCTAT CACAGATCTG AGCCCAACCT 4600 

CATCT6CAAC TTTCCATATT TCTTGATCAC TCCACTGTTC ATAGG6ATCC 4650 

AAGTTTTTTC TAAATGTTCC AGAAAAAATA AATACTTTCT 6TGGTATCAC 4700 

TCCAAA6GCT TTCCTCCACT GTTGCAAAGT TATTGAATCC CAAGACACAC 4750 

CATC6ATCTG GATTTCTCCT TCAGT6TTCA 6TAGTCTCAA AAAAGCT6AT 4800 

AACAAAGTAC TCTTCCCTGA TCCAGTTCTT CCCAA6AGGC CCACCCTCT6 4850 

GCCAG6ACTT ATT6A6AA66 AAATGTTCrrC TAATATGGCA TTTCCACCTT 4900 

CTGTGTATTT TGCT6TGAGA TCTTTGACAG TCATTTG6CC CCCTGAGGGC 4950 

CA6AT6TCAT CTTTCTTCAC GTGTGAATTC TCAATAATCA TAACTTTCGA 5000 

GAGTTG6CCA TTCTTGTATG GTTTG6TTGA CTTGGTAGGT TTACCTTCTG 5050 

TTGGCATGTC AAT6AACTTA AAGACTC6GC TCACA6ATCG CATCAAGCTA 5100 

TCCACATCTA TGCTGGAGTT TACAGCCCAC T6CAATGTAC TCATGATATT 5150 

CATGGCTAAA GTCAGGATAA TACCAACTC?r TCCTTCTCCT TCTCCTGTTG 5200 

TTAAAATG6A AATGAAGGTA ACAGCAATGA AGAAGATGAC AAAAATCATT 5250 

TCTATTCTCA TTTGGAACCA GCGCAGTGTT GACA6GTACA A6AACCAGTT 5300 
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GGCA6TATGT 


AAATTCAGA6 


CTTTGT66AA CAGAGTTTCA AAGTAAGGCT 


5350 


GCCGTCCGAA 


GGCACGAAGT 


GTCCATAGTC CTTTTAAGCT TGTAACAAGA 


5400 


TGA6T6AAAA 


TTGGACTCCT 


GCCTTCAGAT TCCA6TTGTT TGA6TT6CT6 


5450 


TGA66TTT6G 


AGGAAATATG 


CTCTCAACAT AATAAAAGCC ACTATCACT6 


5500 


GCACTGTTGC 


AACAAA6ATG TAGGGTTGTA AAACT6CGAC AACTGCTATA 


5550 


6CTCCAATCA 


CAATTAATAA 


CAACTGGATG AAGTCAAATA TGGTAAGAGG 


5600 


CA6AA66TCA 


TCCAAAATTG 


CTATATCTTT GGAGAATCTA TTAAGAATCC 


5650 


CACCTGCTTT 


CAACGTGTTG AGGGTT6AGA TAGGTGCTTG AA6AACA6AA 


5700 


T6TAACATTT 


TGTGGT6TAA AATTTTC6AC ACTGT6ATTA GAGTATGCAC 


5750 


GAGTG6TAGA 


CCTCTGAAGA 


ATCCCATAGC AAGCAAAGTG TCGGCTACTC 


5800 


CCACGTAAAT 


GTAAAACACA 


TAATACGAAC TGGTGCTGGT GATAATCACT 


5850 


GCATAGCT6T 


TATTTCTACT 


ATGAGTACTA TTCCCTTTGT CTTGAAGA6G 


5900 


AGTGTTTCCA 


AGGAGCCACA 


GCACAACCAA AGAA6CAGCC ACCTCTGCCA 


5950 


GAAAAATTAC 


TAAGCACCAA ATTAGCACAA AAATTAAGCT CTT6TGGACA 


6000 


GTAATATATC 


GAAGGTATGT 


GTTCCATGTA GTCACTGCTG GTATGCTCTC 


6050 


CATATCATCA 


AAAAAGCACT 


CCTTTAAGTC TTCTTOGTTA ATTTCTTCAC 


6100 


TTATTTCCAA 


GCCAGTTTCT 


TGAGATAACC TTCTTGAATA TATATCCAGT 


6150 


TCA6TCAAGT 


TTGCCTGAGG 


GGCGAGT6AC ACTTTTCGTG TGGATGCTGT 


6200 


TGTCTTTCGG 


TGAATGTTCT 


GACCTTGGTT AACTGAGTGT GTCATCAG6T 


6250 


TCAGGACAGA 


CTGCCTCCTT 


CGTGCCTGAA GCGTGGGGCC AGTGCTGATC 


6300 


ACGCTGATGC 


GAGGCAGTAT 


CGCCTCTCCC TGCTCAGAAT CTGGTACTAA 


6350 


GGACAGCCTT 


CTCTCTAAAG 


GCTCATCAGA ATCCTCTTCG ATGCGATTCA 


6400 


TTTGTAAGG6 


AGTCTTTTGC ACAATGGAAA ATTTTC6TAT AGAGTTGATT 


6450 


GGATTGAGAA 


TAGAATTCTT 


CCTTTTTTCC CCAAACTCTC CAGTCTGTTT 


6500 


AAAAGATTGT 


TTTTTTGTTT 


CTGTCCAGGA GACAGGAGCA TCTCCTTCTA 


6550 


ATGAGAAACG 


GTGTAAGGTC 


TCAGTTAGGA TTGAATTTCT TCTTTCTGCA 


6600 
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CTAAATTGGT CGAAAGAATC ACATCCCATG AGTTTTGAGC TAAAGTCTG6 6650 

CTGTAGATTT TGGAGTTCTG AAAATGTCCC ATAAAAATAG CTGCTACCTT 6700 

CATGCAAAAT TAATATTTTG TCa^GCTTTCT TTAAATGTTC CATTTTA6AA 6750 

6T6ACCAAAA TCCTA6TTTT GTTAGCCATC AGTTTACAGA CACA6CTTTC 6800 

AAATATTTCT TTTTCTGTTA AAACATCTAG GTATCCAAAA G6A6AGTCTA 6850 

ATAAATACAA ATCAGCATCT TTGTATACTG CTCTTGCTAA AGAAATTCTT 6900 

GCTOGTTGAC CTCCACTCAG TGTGATTCCA CCTTCTCCAA GAACTATATT 6950 

GTCTTTCTCT GCAAACTTGG AGATGTCCTC TTCTAGTTGG CATGCTTTGA 7000 

TGACGCTTCT GTATCTATAT TCATCATAGG AAACACCAAA GATGATATTT 7050 

TCTTTAATGG TGCCAGGCAT AATCCAGGAA AACT6AGAAC AGAATGAAAT 7100 

TCTTCCACTG TGCTTAATTT TACCCTCTGA AGGCTCCAGT TCTCCCATAA 7150 

TCATCATTAG AAGTGAAGTC TTGCCTGCTC CAGTGGATCC AGCAACCGCC 7200 

AACAACTGTC CTCTTTCTAT CTTGAAATTA ATATCTTTCA GGACAGGAGT 7250 

ACCAAGAAGT GAGAAATTAC TGAAGAAGA6 GCTGTCATCA CCATTAGAAG 7300 

TTTTTCTATT GTTATTGTTT TGTTTTGCTT TCTCAAATAA TTCCCCAAAT 7350 

CCCTCCTCCC AGAAGGCTGT TACATTCTCC ATCACTACTT CTGTAGTCGT 7400 

TAA6TTATAT TCCAATGTCT TATATTCTTG CTTTTGTAAG AAATCCTGTA 7450 

TTTTGTTTAT TGCTCCAAGA GAGTCATACC ATGTTTGTAC AGCCCAGGGA 7500 

AATTGCCGAG T6ACC6CCAT GCGCAGAACA ATGCA6AATG AGATGGTG6T 7550 

GAATATTTTC CGGAGGAT6A TTCCTTTGAT TAGTGCATAG GGAAGCACAG 7600 

ATAAAAACAC CACAAAGAAC CCTGAGAAGA AGAAGGCTGA GCTATTGAAG 7650 

TATCTCACAT AGGCTGCCTT CCGAGTCAGT TTCAGTTCTG TTTGTCTTAA 7700 

GTTTTCAATC ATTTT T TCCA TTGCTTCTTC CCAGCAGTAT GCCTTAACAG 7750 

ATTGGATGTT CTCGATCATT TCTGAGGTAA TCACAAGTCT TTCACTGATC 7800 

TTCCCAGCTC TCTGATCTCT GTACTTCATC ATCATTCTCC CTAGCCCAGC 7850 

CTGAAAAAGG GCAAGGACTA TCAGGAAACC AAGTCCACAG AAGGCAGACG 7900 
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CCTGTAACAA CTCCCA6ATT AGCCCCATGA 
GCGATCCACA CGAAATGTGC CAATGCAAGT 
GTT6TTGGAA AGGA6ACTAA CAAGTTGTCC 
CACGGCTT6A CAGCTTTAAA GTCTTCTTAT 
ATTCTCATCT 6CATTCCAAT 6T6ATGAAG6 
6A6CAGT6TC CTCACAATAA A6A6AAG6CA 
TCGCGATAGA GCGTTCCTCC TTGTTATCCG 
CTTCCCAGTA A6A6AG6CTG TACTGCTTTG 
AAAGATTCCA TAGAACATAA ATCTCCAGAA 
TAAT6AGTTT AGGATTTTTC TTT6AAGCCA 
TCCAATTTTT CAGATAGATT GTCAGCA6AA 
TATGTCTGAC AATTCCAGGC GCTGTCTGTA 
TG6TCCA6CT 6AAAAAAAGT TTGGAGACAA 
GACCTCTGCA TGGTCTCTCG GIGCGCTGGGG 
GCTCAAGCTC CTAATGCCAA AGGAATTCCT 
GTTCTAGAGC GGCCGCCACC GCGGTGGCTG 
GCGCTTCGCT T7TTATAGGG CCGCCGCCGC 
AAACTTTCG6 AGC6C6CCGC TCT6ATTGGC 
TC6CCCCGCC COTCCCCTCG CCCCGCCCC6 
CCCCCCCCCC CC6CCCCCAT CGCT6CACAA 
ATACAAAATT GGGGGTGGGG AGGGGGGGGA 
ACGTGGCCTC GAGTAGATGT ACTGCCAAGT 
T6TACTGGGC ATAATGCCAG GCGGGCCATT 
GGGG6CGTAC TTG6CATATG ATACACTTGA 
TTACCGTAAA TACTCCACCC ATTGACGTCA 
TTACTATGGG AACATACGTC ATTATTGACG 



G6AGTGCCAC 


TTGCAAAG6A 


7950 


CCTTCATCAA 


ATTTGTTCAG 


8000 


AATACTTATT 


TTATCTAGAA 


8050 


AAATCAAACT 


AAACATA6CT 


8100 


CCAAAAAT6G 


CTGGGT6TA6 


8150 


TAA6CCTAT6 


CCTAGATAAA 


8200 


G6TCATAGGA 


AGCTATGATT 


8250 


GT6ACTTCCC 


CTAAATATAA 


8300 


AAAACATCGC 


CGAAGG6CAT 


8350 


GCTCTCTATC 


CCATTCTCTT 


8400 


TCAACAGAAG 


GGATTTGGTA 


8450 


TCCTTTCCTC 


AAAATTGGTC 


8500 


CGCTGGCCTT 


TTCCA6AGGC 


8550 


TCCCTGCTAG 


G6CCGTCTGG 


8600 


6CAGCCCGGG 


GGATCCACTA 


8650 


ATCCCGCTCC 


CGCCCGCCGC 


8700 


CGCCGCCTCG 


CCATAAAAGG 


8750 


TGCC6CCGCA 


CCTCTCC6CC 


8800 


CCCCGCCTGG 


C6CGC6CCCC 


8850 


AATAATTAAA 


AAATAAATAA 


8900 


GATGG6GAGA 


6TGAAGCAGA 


8950 


AGGAAAGTCC 


CATAAGGTCA 


9000 


TACCGTCATT 


GACGTCAATA 


9050 


TGTACTGCCA 


AGTGGGCAGT 


9100 


ATGGAAAGTC 


CCTATTGGCG 


9150 


TCAATGGGCG 


6GG6TCGTTG 


9200 
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G6CGGTCA6C CA6GCGGGCC ATTTACCGTA AGTTATGTAA CGACCT6CAG 9250 

GCTGATCTCC CTTAGACAAAT ATTACGCGCT ATGAGTAACA CAAAATTATT 9300 

CA6ATTTCAC TTCCTCTTAT TCAGTTTTCC C6C6AAAATG GCCAAATCTT 9350 

ACTCG6TTAC GCCCAAATTT ACTACAACAT CCJCCTAAAA CCGCGCGAAA 9400 

ATTGTCACTT CCTGTGTACA CCGGCGCACA CCAAAAACGT CACTTTTGCC 9450 

ACATCCGTCG CTTTACATGTG TTCCGCCACA CTTGCAACAT CACACTTCCG 9500 

CCACACTACT ACGTCACCC6 CCCCGTTCCC AC6CCCC6CG CCACGTCACA 9550 

AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA TAAGGTATAT 9600 

TATTGATGAT GCTA6CATGC GCAAATTTAA AGCGCTGATA TCGATCGCGC 9650 

GCAGATCTGT CATGATGATC ATTGCAATTG GATCCATATA TAGGGCCC6G 9700 

GTTATAATTA CCTCAGGTCG ACGTCCCATG GCCATTCGAA TTCGTAATCA 9750 

TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA 9800 

CAACATACGA GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAAT6AG 9850 

TGAGCTAACT CACATTAATT GCGTTGCGCT CACT6CCCGC TTTCCAGTCG 9900 

GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG 9950 

AGGCGGTTTG CGTATTGGGC GC ^^"^^ 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 4: 
TAGTAAATTT GGGC 
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(2) INFORIUITION FOR 5EQ ID NO: 5: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
A6TAA6ATTT 6GCC 14 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
A6T6AAATCT 6AAT 14 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAATAATTTT GT6T 14 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doubl 

(D) TOPOLOGY: \inknown 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xl) SEQUENCE DESCRIPTION; SBQ ID NO: 8: 
CGTAATATTT GTCT 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unJaiown 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

WANMTTTG ® 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCAATTCCAT CATCAATAAT ATACCTTATT TTGGATTGAA GCCAATAT6A 50 

TAATGAGGGG GTGGAGTTTG TGAC6TGGCG CGGGGCGT6G GAACGGGGC6 100 

GGTGACGTAG GTTTTAGGGC GGAGTAACTT GTATGTGTTG GGAATTGTAG 150 

TTTTCTTAAA ATGGGAAGTT ACGTAACGTG GGAAAACGGA AGTGACGATT 200 

TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT TTCTGGGCGT AGGTTCGCGT 250 

GCGGTTTTCT GGGTGTTTTT TGTGGACTTT AACCGTTACG TCATTTTTTA 300 

GTCCTATATA TACTCGCTCT GCACTTGGCC CTTTTTTACA CTGTGACTGA 350 

TTGAGCTGGT GCCGTGTCGA GTGGTGTTTT TTTAATAGGT TTTCTTTTTT 400 
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ACTGGTAAGG CTGACTGTTA GGCTGCCGCT 
CTG6A6CGGG AGGGT6CTAT TTTGCCTAG6 
TTATGTGTTT TTCTCTCCTA TTAATTTT6T 
TAATGTT6TC TCTAC6CCT6 CG6GTAT6TA 
CTGCAGGTCG ACTCTAGAGG ATC06AAAAA 
AACCTGAAAC ATAAAAT6AA TGCAATTGTT 
A6CTTATAAT GGTTACAAAT AAA6CAATA6 
AAGCATTTTT TTCACTGCAT TCTAGTT6T6 
GTATCTTATC ATGTCTGGAT CCCC6C6GCC 
CCC6GGCTGC AG6AATTCCG TAACATAACT 
CA6TAAA6CA 6TAATATAAT ACAATA6TAA 
T6ATATGTTG T6AAAATGCA 6TAAAACT6A 
AAAT6TTACA 6TGTTGGT6T TAAAACACAA 
AGAGTCCAGT ACCTGGAGAC AATGATGATA 
CTTCAGTTAC ACT6ATTATG ATTTACACTT 
6AACAT6AAA TGAT6TCCAA ATTAT6CTTA 
AGTTTTTATT CAAATATTTT GATA6ATTCA 
A6ATAAAAC6 AAAA6ATTAA AACAAAACTA 
TTTTAGAATG AAACTTAAAA CTTCTTAGTA 
AAATCTT6GT 6AAAACAAAT CCTTGGATAA 
TAAA66A6AG A6A6A6A6AA AA6CAAGACC 
TATCTTA6AG CTTTG66TTT TCTTTT66AA 
ACTGGT6TCC ACACAACA6A CAA6TGGT6A 
ACAATTACTA 6AAACACCCC AAAACCAAAG 
AAGCTGTGPTT TGAT6TTAAT TACAATTAAT 
TAGAAGTTAA TTACACTTGA CGTTAGAGGT 



GTGAAGCGCT 


GTATGTTGTT 


450 


CAGGAGGGTT 


TTTCAGGTGT 


500 


TATACCTCCT 


AT6GGG6CTG 


550 


T^CCCCCCAA 


GCTT6CATGC 


600 


ACCTCCCACA 


CCTCCCCCTG 


650 


GTTGTTAACT 


TGTTTATTGC 


700 


CATCACAAAT 


TTCACAAATA 


750 


GTTT6TCCAA 


ACTCATCAAT 


800 


GCTCTAGAAC 


TAGT6GATCC 


850 


GCGTGCTTTA 


TTGAGATACA 


900 


GGCATATATT 


TGGTGAAATC 


950 


A6TTTAAAAA 


AATAATTA6T 


1000 


TCTATTAT6A 


TACTCAAGTA 


1050 


CATGCCATGT 


GATGATTATG 


1100 


TAATACTTGA 


TGGTTATAAA 


1150 


AAATCAGCAA 


TAAAGCTCTC 


1200 


CTCCAGAACT 


AATATCTAAA 


1250 


TGCACTCTAT 


CTACCTT66A 


1300 


GGAAAGGAAC 


CCCTTGTTTT 


1350 






1400 


AGAACCAAAT 


TTCAATTT6T 


1450 


ATTATAAATG 


AAAAAA6GAA 


1500 


A6TT6TGAAA 


TTA66TGT6C 


1550 


TGAGGTAGAA 


ATAGCATGAG 


1600 


AATGGACAAA 


ACCCACTC6C 


1650 


AACAGATTTG 


CAAAATGATA 


1700 
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GGACAGTGAT TTCTATTGAG AGAATGCTCT 
ACTGGCATGA GA66A6TAAA GC7ICTTCCTA 
GCACTTTTTC TCCTG6TTCA ATGACTTGCA 
CGTCAACTA6 ACCAGAGA6T TTGGA6AC6C 
AACCACTGTG CCTTCTCACC CACAATCCTG 
AACCAATGCA AAGGA6ACAA ATGCAGTTCA 
CACCAGG6TC ACAAT6TGAT TGGGTTACTT 
CTTGCAGCAT TAAAAAAAAA AATCATCACA 
AAAATCTAAA ATCTAAAATT CATCATCATC 
CAACAACAAA ACCACCCACT TCAGGTTGAG 
ATTTAGTTGT AATTATA6AG ATGTTTATAT 
CCATTCTTTT ACA6AGTTGT TGCTCCCCTC 
CGCAACCTTT AGCTCCTACC ATCTTCCTCC 
TGTCATCTGA TGTTCTATTG CAGAAACATC 
AGGAAGTTGA ATATATCAGC CAACAAATTA 
TATTCATTCG CATGTTCCTT 6AAAAAAATG 
AAAGTTTAAA ACTAGAAACA TCTGGA6CCC 
CGGTAGTCTC CTGGCTTT6G GCTCCAGGGA 
AGATAAGCCC AGATGACTAG AAGCAATTTC 
ATTTGAAGAA GTAACTTCAT ATCTATTTAT 
TATACTT6TA 6ACATATAGA TGTATAAAAT 
ACTCAGTCAA CAATTCTCAA AAGAGCAATA 
GTTCGTATGC AAGAAAATAA AAAAAC6TCA 
CGCTAAAGTA ATGCAAAACA ATGTGCTGCC 
TGTGTGTGTG GTGGGTTCGT GCATGTATGT 
GTGTGTGTGT GTGTGTGTGC GTGTGTGTTT 



TTAAATGCTA AGAAGAAGAA 


1750 


6CAGTCCTTA GCTTTCTGTT 


1800 


TTTGTTTAGA CATTTCA6CC 


1850 


T1JT6CTCTC AAAACTTTCC 


1900 


TGTGGAGTTA CTTGCAGGGA 


1950 


TGGGCTTCTG GACTGATATT 


2000 


TCTTAACAGT AATCCTAA6T 


2050 


AT6AAGAAAA AAAAACCCAA 


2100 


ATCAACAACA ACAACAACAA 


2150 


TTTATGAAGA GGGCAGAACA 


2200 


GTATAGTTGT AAATATTCAT 


2250 


ATATAAATTG ACTGAGGA6C 


2300 


TACTGTCTGG GAGTTAAAAA 


2350 


ATTAAATATA ACCCAACAGT 


2400 


CTATGATAGT AAGTCCTGTG 


2450 


AATCCTCTAG CTCTCAGTGG 


2500 


TAGACAATAT TTTA^TGTGG 


2550 


AAATTCACTC TTGCCCAA6C 


2600 


CATTAGGAAG TGGCAAGAAC 


2650 


CTATATACCT ATAGTATTTA 


2700 


6AAAGCCCAT AGCCAGCCCC 


2750 


TGAA6CA6TC ATTTG6T6G6 


2800 


T6AATTCCAT AT6AATACCA 


2850 


TCAGTGTGTG TGTGTGTGTG 


2900 


6TGC6TGTGT GTGTGTGTGT 


2950 


6TTTAGGGGT TTTTATAAAC 


3000 



W09<»/13597 



PCT/US95/14017 



85 

AACTTTTTTT ATAAAGCACV CTTTAGTTTA 
TATAAATTTT TAAACAACCC AAAAT6C6TT 
TTATTTA6CT ATCAAGATTT TACAT6TTTT 
TT6CATA6AC 6T6TAAAACC T6CCATT6TT 
AGAAACTACT GAAATCTACA 6TATA6TACC 
A6ATTTTATT TCTTGTAAAC TCTTACT6TC 
ATATTATAAA AACCATGCGG 6AATCA6GAG 
TCCTTCTTCA TCTGTCAT6A CTGAAACTAA 
AATCATCTGC CATGTGGAAA AGGCTTCCTA 
6CTTTCCG6G GGCATTTCTT CCTCTT6AAC 
T6CTCCATCA CTTCTTCTAA CCCTGTGCTT 
AAGATCTTCC TCACCCATAG ATTCTGAAGT 
GCAGCATA66 CT6ACTGCTA TCTGACCTCT 
GACACCGTGG TGCCATTCAC CTTAGCTTCA 
CTGTCTCAGT CTATGTAACT 6AGACTCCAG 
66ATTTGCAT CCT6GCTTCC AGGCGTCCTT 
6CCTCA6CAA TGAGCTCAGC ATCCCT6G6A 
CATCTCAGGA 6GAGAT6GCA GTGGAGACAG 
GCTTCAGGCG ATCATATTCT GCTTGCAGAT 
TCTGCTA6GA TTCTCTCTAG CTCCCCTCTT 
CAAGATCT6G 6CAGGACTAC GAG6CTGGCT 
AACTTTGGCA GTAATGCT66 ATTAACAAAT 
TTAGGAGAGA TGCTATCATT TAGATAAGAT 
TGCTAGCCTG CTAGCATAAT GTTCAATGCG 
AAAGCTGGGG 6GACGAGGCA 6GCGCAGAAT 
AGAGTAACGG GAGTTTCCAT GTTGTCCCCC 



CAATCTCTCT 


TTATAACTGT 


3050 


CCATATAAA6 


AAAT66CAA6 


3100 


CTTTTAACTT TTTT6TACAA 


3150 


AXCAAAACAA 


TAACA6ACTT 


3200 


ACTACCCTTC 


ACAAAAATAT 


3250 


TAATCCTCTT 


TGTT6TA06A 


3300 


TTGTAAAACA 


TTTATTCT6C 


3350 


G6ACTCCATC 


6CTCT6CCCA 


3400 


CATT6T6TCC TCTCTCATT6 


3450 


TAGGGAAG6A 


GTTGTTGA6T 


3500 


GT6TCCTGG6 GAGGACTCAG 


3550 


TT6ACT6CCA ACCACTCG6A 


3600 


6CA6A6AG6T 


G6AAGGAGAG 


3650. 


GCCTGGGGCT 


6CTCCAGGA6 


3700 


CT6TTTATT6 


T6GTCTTCCA 


3750 


TGTGTT66C6 


CAGTAGCTTA 


3800 


CTCT6A6GA6 AGGT6GGCAT 


3850 


6CCTTTATGC 


TCATGCTGCT 


3900 


TCCTGTTTTC 


TTCCTCAAGA 


3950 


TCCTCACTCT 


CTAAGGAAAT 


4000 

^ www 


CA6G6G6GA6 


TCCTGGTTCA 


4050 


6TTCATCATC 


TAT6CTCTCA 


4100 


CCATTGCT6T 


TTTCCATTTC 


4150 


T6AATGA6TA 


TCATCGTGTG 


4200 


CTACTG6CCA 


GAAGTTGATC 


4250 


TCTAACACA6 


TCT6CACTG6 


4300 
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CAGGTA6CCC ATTCGGG6AT GCTTTCGCAAA 
TGTTTTTTAG TACCTTG6C6 AAGTC6CGAA 
GGAGTGCAAT ACTCTACCAT GGG6TAOTGC 
TCGGCCA6AA AAAAAGCAAC TTTGGCAGAT 
GGCTTCTGTA CCTGAATCCA AT6ATT6GAC 
TTGGCTTGAT GCTTGGCAGT TTCA6CA6CA 
CAGCCACACC ATAGACTGGG GTTCCAGGCG 
CAGCTTCAAT CTCA66TTTA TTATTGGCAA 
CTCG6CTCAA T6TTACTGCC CCCAAAG6AA 
TGGGATTTGA ATAGAATCAT GCAGAA6AAG 
AAAA6CCAGT T6AACTTGCC ACTT6CTTGA 
TCCAA6T6T6 CTTTACACA6 A6AAAT6ATG 
ACGGATCCTC CCTGrTTOTC CC6TATCATA 
GACACATATC CACACAGA6A 66GACATT6A 
TCCAGACGAT CATAAATTGT AGTCAAACA6 
CATGG6CT6G TCATTTTGCT TGAG6TT6T6 
CAGCT6ACAG GCTCAA6AGA TCCAA6CAAA 
AGCTTCATGG CA6TCCTATA C6CGGAGAAC 
TAAAGACTGG TAGAGCTCTG TCATTTTGGG 
GGGTCTCGT6 GTTGATATAG TAGGGCACTT 
TCCCAGGGAC CCTGAACTGA AGTGGAAAG6 
AAAGTCCCT6 TGGGCTTCAT GCAGCTGTCT 
CCT6TAGAAG CCTCCATCTG GTATTCA6AT 
TAAGGT6AGA GCTGAAT6CC CAGT6TGGTC 
6ACAC6ATTG ACATTCTCTT TAAGAGGTGC 
TGACTTTTTC AAGGTGATCT TGCAGA6AGT 



ATACCTTTTG 


GTTCGAAATT 


4350 


CATCTTCTCC 


GGATGTAGTC 


4400 


ATTTTATGGC 


CCTTTGCAAC 


4450 


GTcATAATTA 


AAAT6CTTTA 


4500 


ACTCCTTACA 


GATGTTACAC 


4550 


GCCACTCT6T 


GCAAGACGGG 


4600 


CATCCAGTCA 


AGGAAGAGAG 


4650 


ATTGGAAGGA 


GCTCCT6ACA 


4700 


6CAACTTCAC 


CCAACTGTCT 


4750 


ACCCAGCCTA 


CGCTGGTCAC 


4800 


AAAGGTATCT 


GTACTTGTCT 


4850 


CCAGTTTTAA 


AA6ACAGGAC 


4900 


AACATTGAGA 


AGCCAGTT6A 


4950 


CCAGATTGTT 


GT6CTCTTCC 


5000 


TTAATTATCT 


GCAGGATATC 


5050 


CTGGTCCAGG 


6CATCACAT6 


5100 


GGGCCTTCTG 


GAGCCTTCTG 


5150 


CTGACATTAT 


TCAG6TCA6C 


5200 


GTGGTCCCAA 


CAAGTGGTTT 


5250 






5300 


AAGTGCTGGG 


ATGCAGGACC 


5350 


GACACGGTCC 


TCCACAGCCA 


5400 


CTTCCAAAGT 


GCTGAG6TTA 


5450 


AGCTGATGTG 


CAAGGTCATT 


5500 


AATTTCTCCC 


CGAAGTGCCT 


5550 


CAATGAGGAG 


ATCCCCCACT 


5600 
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66CT6CCA66 ATCCCTT6AT CACCTCA6CT 
TTCATCG6CA 6CTTCCTGAA 6TTCCT6GAG 
TTTTTCTCT6 CCAATCA6CT 6AGC6CAGGT 
TTGACCTCTT CAGCCTGCTT TC6TAG6AGC 
TTCTTCA6GA 66CA6TTCTC T6G6CTCCTG 
CCAAA6GCT6 CTCTGTCA6A AATATTCTCA 
ATTACAG6TT CTTTAGTTTT CAATTCCCTC 
ATTCTGCTTC T6AACT6CT6 6GAAATCACC 
TCA6TTCATC ATCTTTCAGC T6TAGCCAAA 
A6ATGCAAAC 6CTTCCACTG 6TCA6AACTT 
6TT6A6AGAC TTTTTCTGAA GTTCACTCCA 
AA06TCTTTG TAACA6666T 6CTTCATCC6 
ATTTTTT66C CATTTTCATC AA6ATTGT6A 
AATTTCTCCT TGGAGATCTT GCCATGGTTT 
TGGAGTCTTC TAGGAGCTTC TCCTTACGGG 
GCAGTTGrrr CTGCTTCCGT AATCCA66AA 
AGGGAACT6C T6CAGTAATC TATGAGTTTC 
CACTTACTCT TTTAT6AAT6 TTTCCCCAAG 
ATCATGTGTA CTTTTCTGGT ATCATCAGCA 
CAGTGCCAAA TCATTT6CCA CGTCTACACT 
CTTTGGCCAA CTGCTTGGTT TCTGT6ATCT 
GTGT6AG6AC CTTCTTTCCA T6AGTCAAGC 
6ACCTGTTC6 GCTTCTTCCT TAGCTTCCAG 
ACATTTCATT CAACTGTTGT CTCCT6TTCT 
TCCCACTGAA TCTGAATTCT TTCAATTCGA 
TTCTTGATTG CTGGTTTTGT TTTTCAAATT 



TG6CGCAACT 


TGAGGTCCAG 


5650 


TCTTTCAA6A 


6CTTCATCTA 


5700 


TCAATTT6TC 


CCATTCAG06 


5750 


0(>AGT6ACAT 


TCTGA6CTCT 


5800 


GTAGAGTTTC 


TCTAGTCCTT 


5850 


CAGTCTCCAG 


A6TACTCATG 


5900 


TTGAAG6CCC 


TAT6TATATC 


5950 


AC0GATG66T 


6CCT6ACG6C 


6000 


CAA6AAGTTC 


CT6AAGA6AA 


6050 


6CTTCCAAAT 


GGGACCTAAT 


6100 


CTT6AAATTC 


ATGTTATCGA 


6150 


AACCTTCCA6 


G6ATCTCA66 


6200 


TA6ATATCTG 


T6T6AGTTTC 


6250 


CATCAGCTCT 


CTGACTCCCC 


6300 


AAGCGTCCTG 


TAGGACATTG 


6350 


A6AAACTTCT 


CCAGGTCCA6 


6400 


TTCCAAA6CA 


GCCTCTT6CT 


6450 


AAGTATTGAT 


ATTCTCTGTT 


6500 


GAATAGTCCC 


GAAGAAGTTT 


6550 


TATCTGCCGT 


TGACBGACGJ. 


DOUU 


TCTTTT66AT 


T6CATCTACT 


6650 


TT6CCTCT6A 


CCT6TCCTAT 


6700 


CCATT6TGTT 


GAATCCTTTA 


6750 


GCAGCT6TTC 


TTGAACCTCA 


6800 


TCAGTAATGA 


TTGTTCTA6C 


6850 


CTGGGCAGCA 


GTAAT6A6TT 


6900 



wo 96/13597 



PCT/US95/14017 



88 

CTTCCAATTG GGGGCGTCTC TGTTCCAAAT CTTGCAGT6T TGCCTTCTGT 6950 

TTGATGATCA TTTCATTGAT GTCTTCCAGA TCACCCACCA TCACTCTCTG 7000 

T6ATTTTATA ACTCGATCAA 6CA6A6ACAG CCAGTCTGTA A6TTCTGTCC 7050 

AAGCTCGGTT GAAGTCTGCC AGTGCAGGTA CCJCCAACAG CAAAGAAGAT 7100 

GGCATTTCTA GTTTGGAGAT GACAGTTTCC TTAGTAACCA CAGATTGTGT 7150 

CACTAGAGTA ACAGTCTGAC TGGCAGAGGC TCCAGTAGTG CTCAGTCCAG 7200 

G6GCACGGTC AGGCTGCTTT GTCCTCAGCT CCCGAAGTAA ATGGTTTACA 7250 

GCCTCCCACT CAGACCTCAG ATCTTCTAAC TTCCTCTTCA CTGGCT6A6T 7300 

GCTTG6TTTT TCCTTATACA AAT6CT6CCC TTT06ACAAA AGCCTTTCCA 7350 

CATCCGCTTG TTTACCGTGA ACTGTTACTT CAATCTCCTT TATGTCAAAC 7400 

GGTCCTGCCT GACTTGGTTG GTTATAAATT TCCAACTGGT TTCTAATAGG 7450 

A6AGACCCAC A6AA6CA66T 6ATCCAGCTG CTCTTCAAGC TGCCTAAAAT 7500 

CTTTTAA6T6 AACCTCAAGC TCTCCTTGTT TCTCAGGTAA A6CTCTG6A6 7550 

ACCTTTATCC ACTGGAGATT TGTCTGTTTG A6CTTCTTTT CAAGTTTATC 7600 

TTGCTCTTCT 6GCCTTATGG GAGCACTTAC AA6TACTGCT CCTCCTGTTT 7650 

CATTTAATTG TTTTAGAATT CCCTGGCGCA GGGGCAACTC TTCTGCCAGT 7700 

AACTTGACTT GTTCAAGTTG TTCTTTTA6C TGCTGCTCAT CTCCAAGTGG 7750 

AGTAATAGCA ATGTTATCTG CTTCTTCCAG CCACAAAACA AATTCATTTA 7800 

AATCTCTTTG AAATTCTGAC AAGACATTCT TTTGTTCTTC AATCCTCTTT 7850 

CTCCTTTCTG CCA6CTCTTT GCAGATGTCG TGCCACCGCA GACTCAAGCT 7900 

TCCTAATTTT TCTTGTAGAA TATTGACATC TGTTTTTGAA GACTGTT6AA 7950 

TTATTTCTTC CCCAGTTGCA TTCAGTGTTC TGACAACAGC TTGACGCTGC 8000 

CCAATGCCAT CCTGGAGTTC CTTAAGATAC CATTTGTATT TAGCATGTTC 8050 

CCAGTTTTCA GGATTTTGTG TCTTTTTGAA AAACTGTTCA ACTTCATTCA 8100 

GCCATTGATT AAATACCTTC ATATCATAAT GAAAGTGTCG CCATTTTTCA 8150 

ACTGATCTGT OGAATCGCCC TTGTCGTTCC TTGTACATTC TATGAAGTTT 8200 
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TTCCCCCTGG AAATCCATCT GTGCCACGGC 
CCATGGA66T 66CACTTT6C AA66CTGCT6 
TCAATCC6AC CT6A6ATTT6 TT6CAAATT6 
CTCCTCTT6C TTAAAAA6AT CTTCAAAATC 
TATTTAGAAG ATGATCAACT TCTGAAAGAG 
TCGGTCAAAT AAGTAGAA6G CACATAAGAA 
A6TC6TCACT ACCATA6TTT CTTCAT6GA6 
TGAGTCTTCG AAACT6AGCA AAATT6CTCT 
CTGAGCTG6A TCT6AGTTGG CTCCACTGCC 
CAAGCCCTCA 6CTTGCCT6C GCACTGCATT 
GCAATTCAC6 ATCAATTTCC TTTAATTTTC 
A66CTGGCTA ATTTTTTTTC AATTTCATCC 
AGCCT6CCTC TTGTACTGAT ACCACTG6TG 
TTCTTCTTTG AGACCTCAAA TCCTTGAGAG 
AGCTGCT6TT TTATCTTTAT TTCCTCTCGC 
TT6TTGTAA6 TT6TCTCCTC TTTGCAACAA 
T6TCTTCACT GATATCTTTA TTGAA6TCTT 
TGCTGAATTT CAGCCTCGAG T6GTTCAAGC 
AAACTGCTCC AATTCCTTCA AAGGAATGGA 
TGTGAGAAAT AGCTGCAAAT CGACGGTTGA 
ACTACTTTCC TGCAGT6GTC ACCGC66TTT 
GTCACGT6T6 6AGTCCACCT TT6GGC6CAT 
AACGCTTAAG AATGTCTTCC TTTTGTTGTG 
TCTAAAAGTT CATCTGCATG AATGATCCAC 
CTGATCAAAG 6TTTCCATGT GTTTCTGGTA 
ATTCTTCTAC TCTGGAG6TG ACAGCTATCC 



TTCCTGTACT TTCACCTTTT 


8250 


TCTTCTTCTT GTGAATAATA 


8300 


TCTTTTATAT TCTTAA6AGA 


8350 


T^TAGCACAG A6TTCAG6AG 


8400 


CTTGTAAGAT ATGACTGATC 


8450 


ACATCCAAAG GCATATCTTC 


8500 


A6T6TGAATT TGT6CAAA6T 


8550 


CAATTTGCCG CCAGCGCTTG 


8600 


ATTGCGGCCC CATTCTCA6A 


8650 


CAGCTCCTCT TTCTTCTTCT 


8700 


TTTCATCTCT 6GGTTCAGGT 


8750 


AA6CATTTCA G6AGATCATC 


8800 


AGAAATTTCT AGGGCCTTTT 


8850 


CATTATGTTT T6TCTGTAAC 


8900 


TTTCTCTCAT CTGTGATTCT 


8950 


TTCATTTACA GTACCCTCAT 


9000 


CCTCTTTCA6 ATTCACCCCC 


9050 


AATTTTTGTA TATCTGAGTT 


9100 


GGCCTTTCCA GTCTTAATTC 


9150 


GCTCAGAGAT TTGGGGCTCT 


9200 


QCCATCAATT TT6CT6CTT6 


9250 


6TCATTCATT TCAGCCTTTA 


9300 


GTTTCTTCTT TTCAGACTCA 


9350 


TTTGTGATTT GTTCTATGTT 


9400 


TTCCAACAAA AGATTTAGCC 


9450 


A6TTACT6TT CAGAAGACTC 


9500 
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AGTTTATCTT CTACCAAGGT TTCTTTCTTG 
CTCTCCTAAT TCTGTAACAC TCTTCAAGTG 
CTTTTTGAGT AGCCTTTCCC GAG6CAACTT 
ATTCCTTCAA CTGCTGATCT CTTCGTCAAT 
CCATTCTGTT AAGACATTCA TTTCCTTTCT 
AGCATTTCTC CAACTGTTGC TTTCTCTCTG 
TTGTAATGCA ATTTCAAAGC TGTTACTCGT 
TTCTGTCTGC TTTTTCTGTA GAATTT6A0G 
CCACTTCAGA CTTGACTTCA CTCAG6CTTT 
CTTAGTTGTG ACTGAATTAC TTCCTGTTCA 
AGGCAAATGC ATCTTGACTT CATCTAAAAT 
6TTGTTCAAA ATTGGCTGGT TTTTGGAATA 
TCTTGTAATT TTTTCTGTGC AACATCAATT 
GGCATCCTTC CCCTGGTTAT GTTTCTTCAT 
GACTTGTCAA ATCTGATTGG ATTTTCTGGG 
GCATCCACCT TGTCAGTGAT ATAAGCTGCC 
AAGCGACTCC TGAATTAAGT GCAAG6ACTT 
GGATACTCTG TTCAAGCAAC TTTTGTTTCC 
TCCCTCCAAC GAGAATTAAA CGTCTCAAGC 
CATGACTCCT CCATCTGTAA GAGTCTGTGC 
GGTTCTCCTC TGAATGATGC ATCAGATTTT 
GTGATTTCCT CAGGTCCTGC AGGAACATTT 
TTCTACTTCA TTGAGCCACT TGTTTGCTTT 
CATGCCAACA TGCCCAAACT TCTTCCAAAG 
CTGGTGCACA GCCATTGGTA GTTGGTGGTC 
TAAGGCCTCT TGTGCTGAGG GTGGAGCGTG 



CCCAACACCA 


TTTTCAAAGA 


9550 


AGCCTTCTGT 


TTCTCAATCT 


9600 


CAGAATCCAA ATTACTTGGC 


9650 


TCxGTATCTG 


TTGCTGCCA6 


9700 


CATCTTACGG 


GACAACTTCA 


9750 


TTACCTTCGC 


ACCCAACTCA 


9800 


TCATCAAGCT 


CTTTG6GATT 


9850 


TCC6GTTTTA ATCACCATTT 


9900 


TATACAAGTT 


CACACAAT6A 


9950 


ACACTCTTGG 


TTTCCAATGC 


10000 


CATCTTACTT 


TCCTCTAGAC 


10050 


ATOGAAATTT 


CATGGA6ACA 


10100 


TGTGAAAGAA 


CCCTTTG6TT 


10150 


TTCTTCTAAA 


CTTATCTCAT 


10200 


CTTCCTGAGG 


CATTTGAGCT 


10250 


AACTGCTTGT 


CAATGAATTC 


10300 


TTCAATTTCC 


TGGGCAGACT 


10350 


TCACAGCCTC 


TTCATGTA6T 


10400 


TCCTCATTGA 


TCAGTTCATC 


10450 


CAATAGACGA 


ATCTGATTTG 




CAAGAGATTC 


TAGCACTTCA 


10550 


TCCATGGTTT 


TAAGTTTCAA 


10600 


CTCTAAATAT 


GACAATAACT 


10650 


TTTTGCATTT 


TCCATTCAGC 


10700 


AGAGTTTCAA 


GTTCCTTTTT 


10750 


AGCTATTACA 


CTATTTACAG 


10800 
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TCTCA6TAA6 6AGTTTCACT TTAGTTTCTT 
GCTCTCTTCA TTTCTTCAAC AGCA6TCTGT 
TTCAAAATCT CTCTCTA6AT ATTCTTCTTC 
GCATCTCTGA TA6ATCTTTT TGGAGGCTTA 
TTTAAG6CTT CCTTTCTGGT 6TAGACCTGG 
AGTGTTAA6C TCTCTAAGTT CTGTCTCCAG 
CAGCTTCACT CTTTATCTTC TGCCCACCTT 
G6CT6AATT6 TTT6AATATC ACCAACTAAA 
TTTTTTCAGG ATTTCAGCAT CCCCCAGGGC 
AAACATCAAC TTCAGCCATC CATTTCTGTA 
AATTTTC6AA 6TTTATTCAT ATGTTCTTCT 
CAACT6G6AG 6AAA6TTTCT TCCAGTGCCC 
ACAGATATTT CTG6CATATT TCT6AA66TG 
ACA6TGTCAC TCAGATAGTT GAAGCCATTT 
TTGCAGA6CC TGTAATTTCC CGAGTCTCTC 
TAACACTAA6 ATAAGGTACA GAGAGTTT6C 
GTCCTGATGC TACTCATT6T CTCCT6ATA6 
AAAAATTGTC TGTAGCTCTT TCTCTTTGGC 
GGTTAAAATG ATTAGTAAAG GCCACAAAGT 
CCCTGTCCCT TTTCTTTCAG TTGTAGACTC 
TTGAGGCTGA AGAGCT6ACA ATCTGTT6AC 
ACT6GCTTTT AATTGCTGTT GGCTCTGATA 
AACAAGTTTT CGGCAGTAGT TGTCATCTGT 
ATAAAA6GTA ATGATGTTGG TTTGATACTC 
TCAGCAATTG GCAGAATTCT GTCCACCGGC 
TGTCTGATAC TTTCAGCATT AACACCCTCA 



TTTGTAGTGC 


CTCTTCTTTA 


10850 


AATTCATCT6 


GA6TTTTATA 


10900 


A6CTT6T6TC 


ATCCACTCAT 


10950 


CGGTTTTATC 


CAAACCTGCC 


11000 


CGGCATATGT 


GATCCCACTG 


11050 


TCTGGATGCA 


AACTCAA6TT 


11100 


CATTAACACT 


ATTTAAACTG 


11150 


AGTCTGCATT 


GTTT6AGCTG 


11200 


AGGCCATTCC 


TCTTTCAGGA 


11250 


AGGTTTTTAT 


GTGATTCTGA 


11300 


A6CTTTT66C 


A6CTTTCCAC 


11350 


CTCAATCTCT 


TCAAATTCTG 


11400 


CTTTCTTGGC 


CATCTCCTTC 


11450 


TGTTGCTCTT 


TCAAAGAACT 


11500 


CTCCATTATT 


TCATATTCAG 


11550 


TTTCTGACTG 


CTGGATCCAC 


11600 


CGGATTGGTG 


GTAAAGT6TC 


11650 


CCTCACACCA 


TCAAA6ATGT 


11700 


CTGCATCCAG 


AAACATTGGC 


11750 


T6AATTTTTA 


AxrGCTCAAT 


XXoUU 


TTCATCCTTA 


CAAATTTTTA 


11850 


GGGT6GTAGA 


CT6GGTTTTC 


11900 


TCCAATTGTT 


GTA6CTGATT 


11950 


TAGCCAGTTA 


ACTCTCTCAC 


12000 


TGTTCAGTTG 


TTCTGAA6CT 


12050 


TTTGCCATCT 


GTTCCACCAG 


12100 
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GGCCTGAGCT GATCT6CTGC CATCTTGCAG 
TTTCTCGTGC TAT66CATTG ACTTTTTCTT 
TCTTTTCGAT A6ACTGCAAA TTCAGAACTC 
A6TAATCCAA CTGTGAAGTT CAGTTATATC 
GTTCAGAATC CACAGTTATC TGCCTCTTCT 
AGTTCCTCTT GGGCATGTTT TACCATGATT 
AGTTACCGTT TCCATTACAG TTGTCTGTGT 
TGACAGCCTG T6AAATTTGT GCTGAACTCT 
TTGTCCCAAC GTTGTGCAAA GTTTTCCATC 
CACTGACTTA TTTTTCAGTG CCGAAAGTAG 
GTTTTTCCAT GGTTGGCTTT TTCTTTTCTA 
ATTTTGTGAA GACTT6ACAT CATTTCATTT 
CTGAATGTTC TTCATTGCAT CTTCTTTTTC 
GGCACTGTTC TTCAGTAAAA TGCTGCCATT 
ACAATCCAGC GGTCTTCAGT CCATCTGCAG 
CAGTACCTTA AGTTGTTCTT CCAAAGCAGC 
ATTCATCAAC CACTACTACC ATGTGAGTGA 
TCCTGTTCTA GATCTTCTTG AAGCACCTTA 
TAGATCTTCA AGATCAGGTC CAAAGGGCTC 
TCTCTTCAGT TTTTGTTAAC CAGTCATCTA 
TGGAGATCCA TTAGAACTTT GTGTAATTTG 
TACCCTGAGA CATTCCCATC TTGAATTTAG 
CTTCAGCTTC TTCATCTTCT GATAATTTCC 
CCTAACTGTA GAACATTACC AACAAGTCCT 
CATGAATCCC TCATGAGCAT GAAACTGTTC 
TTGAAATCTC TCCTTGTGCT CGCAATGTAT 



TTTTCTGAAC 


TTCTCTGCTT 


12150 


GCAAGTCTGA 


GATGTTGCCT 


12200 


TGTAATACAG 


CTTCTGAACG 


12250 


GACATCCAAC 


CTTTTCCT6A 


12300 


TTTGAGGAGG 


TGGTGGTGGA 


12350 


TGTTCCCTTG 


TG6TCACCAT 


12400 


TAGGGATGGT 


TGAGTGGTGG 


12450 


TTTCAAGTTT TT6GGTTAAA 


12500 


CAGATTTCCA 


TCTTTTGAGT 


12550 


ATCTTGATTG 


AGTGAACTTA 


12600 


6ATCTATTTT 


TAAAGTAGAT 


12650 


TGATCTTTAA AGCCACTTGT 


12700 


T6AAAGCCAT 


GTACTAAAAA 


12750 


TTAGAAGAAT 


ATCTTGTAAA 


12800 


ATATTTGCCC 


ATCGATCTCC 


12850 


TGTTGCATGA 


TCACCGCTGG 


12900 


GCGAGTTGAC 


CCTGACCTGC 


12950 


TGTTGTTGTA 


CTTGGCATTT 


13000 


TTCCTCCATT 


TTCTTAGTTC 


13050 


GTTCTTTTAA 


TTTCTGATTC 


13100 


CTTTGTTTTT 


CCATGCTAGC 


13150 


GAGATTCATT TGTTCTTGCA 


13200 


CTTTTCCAAC 


TAGTTGACTT 


13250 


TGATGAGATG 


TCAGATCCAT 


13300 


TTTCACTTCT 


TCAACATCAT 


13350 


CCTCGGCAGA AAGAAGCCAT 


13400 
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GAAAGTACTT CTTCTAAA6C AGTTTGGTAA 
CTCCATCAAT GAACTGTCAA GT6ACTTGTC 
6T6AA66ATA 6666CTCT6T 6T66AATCAG 
T6TGT6AA6G CATAACTCTT 6AATC6A6GC 
TTCATA6CCC TGTGCTA6AC TGACTGTGAT 
GGTGATGTAA TTGAAAATGT TCTTCTCTAG 
G6GAACATTT CCACTTCTTG AATGGCTTCA 
AACTTGAAAG AGTGAT6T6A TGTACATTAA 
AA6TGGTAGC AACATCTTCA GGATCAA6AA 
CATTTTGCAA TGTTGAAGGC ATGTTCCAGT 
TGAAACCACA CTATTCCAAT CAAACAGGTC 
6A6CATTCAA A6CCAACCC6 TCG6ACCAGC 
TTAACCTGTG GATAATTACG TGTTGACTGT 
CTTTTCACTG TTGGTTT6CT GCAATCCAGC 
TTTTGACCTG CCAGTGGAGG ATTATATTCC 
TTAT6ATTTC CATCCACTAT 6TCAGTGCTT 
ATTATTTTTC T6TAAGACCC GCAGT6CCTT 
GAACTCTT6T AGATCCCTTT TCTTTT6GCA 
TCCAAGAGGT CTAGGAGGCG TTTTCCATCC 
GTCTATGTGT TGCTTTCCAA ACTTAGAAAA 
TGAATGTTTT CTTTTGAACA TCTTCTCTTT 
TCCCACCAAA GCATTTGGAA 6AAAAAGTAT 
ATCTTGGTAA AA6TTTCTCC CAGTTTTATT 
GATGAGAAGC CAATAAACTT CAGCAGCCTT 
TAGCACTTCA A6TCTTCCTA TTCGTTTTTT 
AGAGCGGAAT TCCTGCA6CC CGGGGGATCC 



CTATCCAGAT 


TTACTTCCGT 


13450 


TCT6GGAGCT 


TCCAAATGCT 


13500 


AG6TG6CAAC 


ATAA6CA6CC 


13550 


OTiAGGAGATG 


AAGAA6TTTG 


13600 


CTGTT6A6A6 


TAATGCATCT 


13650 


TTACTTTT6A 


AGATGTCCTG 


13700 


AT6CTCACTT 


6TTGTGGCAA 


13750 


6AT6GACTTC 


TTGTCTGGAT 


13800 


GTTTTTCTAT 


GCCTAACT6G 


13850 


CTTTGGGTGG 


CTGAGTGCTG 


13900 


GGGCCTGTGA 


CTATGGATAA 


13950 


TA6AG6TGAA 


6TTGAT6AC6 


14000 


C6AACCGA6C 


TCA6AAGAAT 


14050 


CAT6ATAGTT 


TTCATCACAT 


14100 


AAATCAAACC 


AAGAGTGA6T 


14150 


CCTATATTCA 


CTAAATCAAC 


14200 


6TTGACATT6 


TTCAGGGCAT 


14250 


6TTTTT6CCC 


TGTAA6GCCT 


14300 


TGCAGGTCAC 


TGAA6AGGTT 


14350 




#^x wwnx X X X w 


14AOO 


CATAACA6TC 


CTCTACTTCT 


14450 


ATATCAAGGC 


AGGGATAAAA 


14500 


GCTCCA6GAG 


GCTTAGGTAC 


14550 


GACAAAAAAA 


AAAAAAAAAA 


14600 


CTATAAAGCT 


ATTGCCTTCA 


14650 


ACTAGTTCTA 


GAGCG6CCGC 


14700 
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GGGTACAATT CCGCAGCTTT TAGAGCAGAA GTAACACTTC C6TACAGGCC 14750 

TAGAAGTAAA 66CAACATCC ACT6AGGAGC AGTTCTTTGA TTTGCACCAC 14800 

CACC66ATCC GGGACCTGAA ATAAAAGACA AAAA6ACTAA ACTTACCA6T 14850 

TAACTTTCT6 GTTTTTCACT TCCTOGAGTA CCuGATCCTC TA6AGTCC66 14900 

AGGCTG6ATC GGTCCCGGTG TCTTCTATGG AG6TCAAAAC AGCGTG6AT6 14950 

GCGTCTCCAG GCGATCTGAC GGTTCACTAA ACGAGCTCTG CTTATATAGA 15000 

CCTCCCACCG TACAC6CCTA CCGCCCATTT GCGTCAATGG GGCGGAGTTG 15050 

TTACGACATT TTGGAAAGTC CCGTTGATTT TGGTGCCAAA ACAAACTCCC 15100 

ATTGACGTCA ATGGGGTG6A GACTTGGAAA TCCCCGTGAG TCAAACC6CT 15150 

ATCCACGCCC ATTGATGTAC TGCCAAAACC GCATCACCAT GGTAATAGCG 15200 

ATGACTAATA CGTAGAT6TA CTGCCAAGTA GGAAAGTCCC ATAAGGTCAT 15250 

6TACTGGGCA TAATGCCAGG 0G6GCCATTT ACCGTCATTG ACGTCAATAG 15300 

GGGGCGTACT TGGCATAT6A TACACTTGAT GTACTGCCAA GTGGGCAGTT 15350 

TACCGTAAAT ACTCCACCCA TT6ACGTCAA TGGAAAGTCC CTATTGGC6T 15400 

TACTATGGGA ACATACGTCA TTATTGACGT CAATGGGCGG GGGTCGTTGG 15450 

6CGGTCAGCC AGGCGGGCCA TTTACCGTAA GTTATGTAAC GACCTGCAGG 15500 

TCGACTCTAG AGGATCTCCC TAGACAAATA TTACGCGCTA TGAGTAACAC 15550 

AAAATTATTC AGATTTCACT TCCTCTTATT CAGTTTTCCC GCGAAAATGG 15600 

CCAAATCTTA CTCGGTTACG CCCAAATTTA CTACAACATC CGCCTAAAAC 15650 

CGCGCGAAAA TTGTCACTTC CTGTGTACAC CGGCGCACAC CAAAAACGTC 15700 

ACTTTTGCCA CATCCGTCGC TTACATGTGT TCCGCCACAC TTGCAACATC 15750 

ACACTTCCGC CACACTACTA C6TCACCCGC CCOGTTCCCA CGCCCCGCGC 15800 

CACGTCACAA ACTCCACCCC CTCATTATCA TATTGGCTTC AATCCAAAAT 15850 

AAGGTATATT ATTGAT6ATG CTAGCGGGGC CCTATATATG GATCCAATTG 15900 

CAATGATCAT CATGACAGAT CTGCGCGCXSA TCGATATCAG CGCTTTAAAT 15950 

TTGCGCATGC TA6CTATAGT TCTAGAGGTA CCGGTTGTTA ACGTTAGCCG 16000 
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GCTACGTATA CTCCGGAATA TTAATAGGCC 
6CC6CCT6CA GCTGGCGCCA TCGATAC6CG 
6TACAGA6CT C6A6AA6TAC TA6TG6CCAC 
TT66CACT66 CC6TC6TTTT ACAACGTCGT 
TACCCAACTT AATCGCCTTG CAGCACATCC 
ATAGCGAAGA GGCCCGCACC GATCGCCCTT 
AAT6G06AAT G6C6CCT6AT 6CG6TATTTT 
TATTTCACAC CGCATAC6TC AAA6CAACCA 
CXSCATTAAGC 6CGGCGGGTG TGGTGGTTAC 
TTGCCAGCGC CCTAGCGCCC GCTCCTTTCG 
GCCACGTTCG CCGGCTTTCC CC6TCAA6CT 
AGGGTTCC6A TTTAGTGCTT TAOGGCACCT 
TGG6TGAT66 TTCACGTA6T GGGCCATC6C 
CCTTTGACGT TGGAGTCCAC GTTCTTTAAT 
TGGAACAACA CTCAACCCTA TCTCGGGCTA 
TTTTGCC6AT TTOGGCCTAT TGGTTAAAAA 
TTTAA06C6A ATTTTAACAA AATATTAACG 
TCTCA6TACA ATCT6CTCT6 ATGCCGCATA 
CGCCAACACC C6CTGACGCG CCCTGACGG6 
GCTTAGAGAC AAGCTGTGAC CGTCTCCG6G 
TTCACC6TCA TCACC6AAAC 6C6C6AGAC6 
TATTTTTATA 6GTTAAT6TC AT6ATAATAA 
GGCACTTTTC 6GGGAAAT6T GC6CGGAACC 
AATACATTCA AATATGTATC CGCTCATGAG 
TTCAATAATA TT6AAAAA6G AA6AGTATGA 
GCCCTTATTC CCTTTTTTGC G6CATTTTGC 



TAGGATGCAT 


ATGGCGGCCG 


16050 


TAC6TCGCGA 


CCGCGGACAT 


16100 


GTG66CCGTG 


CACCTTAA6C 


16150 


GAwTGGGAAA 


ACCCT6GC6T 


16200 


CCCTTTCGCC 


AGCTGGCGTA 


16250 


CCCAACA6TT 


GCGCAGCCTG 


16300 


CTCCTTACGC 


ATCT6T6C66 


16350 


TA6TACGC6C 


CCT6TAGCG6 


16400 


GCGCAGCGT6 


ACCGCTACAC 


16450 


CTTTCTTCCC 


TTCCTTTCTC 


16500 


CTAAATCGGG 


6GCTCCCTTT 


16550 


CGACCCCAAA 


AAACTTGATT 


16600 


CCT6ATA6AC 


G6TTTTTC6C 


16650 


AGTGGACTCT 


TGTTCCAAAC 


16700 


TTCTTTTGAT 


TTATAAGGGA 


16750 


ATGAGCT6AT 


TTAACAAAAA 


16800 


TTTACAATTT 


TATG6T6CAC 


16850 


6TTAAGCCA6 


CCCCGACACC 


16900 


CTTGTCTGCT 


CCCGGCATCC 


16950 




VX WnVAww A X 


17O0O 

A 9 WWW 


AAA6GGCCTC 


6TGATAC6CC 


17050 


TGGTTTCTTA 


GAC6TCA66T 


17100 


CCTATTTGTT 


TATTTTTCTA 


17150 


ACAATAACCC 


TGATAAATGC 


17200 


6TATTCAACA 


TTTCCGTGTC 


17250 


CTTCCTGTTT 


TTGCTCACCC 


17300 
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AGAAACGCTG GTGAAAGTAA AA6ATGCTGA AGATCAGTTG GGT6CAC6A6 17350 

TGGGTTACAT CGAACTGGAT CTCAACA6CG GTAAGATCCT TGAGAGTTTT 17400 

CGCCCC6AAG AACGTTTTCC AAT6AT6AGC ACTTTTAAA6 TTCTGCTAT6 17450 

TG606CGGTA TTATCC06TA TTGACGCCG6 6CAAGAGCAA CTCGGTOGCC 17500 

GCATACACTA TTCTCAGAAT GACTTGGTT6 A6TACTCACC A6TCACAGAA 17550 

AA6CATCTTA C6GAT6GGAT GACAGTAAGA GAATTATGCA GTGCTGCCAT 17600 

AACCATGA6T GATAACACT6 OGGCCAACTT ACTTCTGACA ACGATCGGAG 17650 

6ACC6AAGGA GCTAACC6CT TTTTTGCACA ACATGG666A TCATGTAACT 17700 

C6CCTT6ATC GTTGG6AACC G6AGCTGAAT GAAGCCATAC CAAACGACGA 17750 

GCX3TGACACC ACGATGCCTG TAGCAATGGC AACAACGTTG CGCAAACTAT 17800 

TAACT6GCGA ACTACTTACT CTAGCTTCCC GGCAACAATT AATAGACTGG 17850 

ATGGAGGCG6 ATAAAGTT6C A66ACCACTT CTGCGCTC66 CCCTTCCGGC 17900 

T6GCT6GTTT ATT6CTGATA AATCTGGAGC C6GT6AGCGT G66TCTCGCG 17950 

GTATCATTGC AGCACT6GGG CCAGATGGTA AGCCCTCCCG TATCGTAGTT 18000 

ATCTACACGA CGGGGAGTCA GGCAACTATG GATGAACGAA ATAGACAGAT 18050 

C6CTGAGATA G6TGCCTCAC TGATTAAGCA TTGGTAACTG TCAGACCAAG 18100 

TTTACTCATA TATACTTTAG ATT6ATTTAA AACTTCATTT TTAATTTAAA 18150 

A6GATCTA6G T6AA6ATCCT TTTTGATAAT CTCATGACCA AAATCCCTTA 18200 

ACGTGAGTTT TCGTTCCACT 6A6C6TCA6A CCCC6TA6AA AA6ATCAAAG 18250 

GATCTTCTTG AGATCCTTTT TTTCTGCGCG TAATCTGCTG CTTGCAAACA 18300 

AAAAAACCAC CGCTACCA6C GGTGGTTTGT TTPCCGGATC AAGAGCTACC 18350 

AACTCTTTTT CCGAA66TAA CTGGCTTCA6 CAGAGC6CAG ATACCAAATA 18400 

CTGTTCTTCT AGTGTAGCC6 TAGTTAGGCC ACCACTTCAA GAACTCT6TA 18450 

6CACCGCCTA CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC 18500 

CAGTGGCGAT AAGTCGTGTC TTACCGGGTT GGACTCAAGA CGATAGTTAC 18550 

CGGATAA6GC 6CA6CGGTC6 GGCTTGAACGG GGGGTTCGTG CACACA6CCC 18600 
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AGCTTGGA6C 6AAC6ACCT?. CACCGAACT6 
ATGA6AAA6C GCCAC6CTTC CC6AA6GGA6 
TAA606GCA6 6GTCGGAACA GGA6AGC6CA 
AAC6CCTG6T ATCTTTATAG TCCT6TC666 
6C6TCGATTT TT6TGATGCT C6TCAGGGG6 
CCA6CAACGC GGCCTTTTTA CGGTTCCTG6 
CACATGTTCT TTCCT6C6TT ATCCCCT6AT 
C6CCTTT6A6 TGA6CT6ATA CC6CTC6CCG 
6C6AGTCA6T 6A6C6AGGAA GC66AA6A6C 
CTCX:CCGCGC GTTGGCCGAT TCATTAATGC 
CGACTGGAAA GC666CA6TG AGCGCAA06C 
CTCATTAG6C ACCCCA66CT TTACACTTTA 
T6TGGAATT6 TGAGOSGATA ACAATTTCAC 
TGATTACGAA TTC6AATGGC CATGGGACGT 
CCGG6CC 



A6ATACCTAC 


AGCGT6AGCT 


18650 


AAA6GC6GAC 


A6GTATCC6G 


18700 


CGAGGGAGCT 


TCCAGGGGGA 


18750 


TTa'CGCCACC 


TCTGACTTGA 


18800 


GCGGAGCCTTA 


TGGAAAAACG 


18850 


CCTTTTGCTG 


6CCTTTTGCT 


18900 


TCTGTGGATA 


ACCGTATTAC 


18950 


CA6CCGAAC6 


ACCGAGOGCA 


19000 


6CCCAATACG 


CAAACCGCCT 


1 O ARA 


A6CTGGCACG 


ACAGGTTTCC 


19100 


AATTAATGT6 


AGTTAGCTCA 


19150 


TGCTTCCGGC 


TCGTATGTT6 


19200 


ACAGGAAACA 


6CTAT6ACCA 


19250 


CGACCTGAGG 


TAATTATAAC 


19300 
19307 
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HHAT IS CLAIMED IS: 

1. A recombinant shuttle vector comprising: 

(a) the DHA sequences of, or corresponding to, 
a portion of the genome of an auenovirus which comprises 
DHA sequences of, or corresponding to, the adenovirus 5' 
and 3' inverted terminal repeats and packaging/enhancer 
domain necessary for replication and virion encapsidation 
in the absence of sequence encoding viral genes; 

(b) a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
operatively linked to the DNA of (a) and capable of 
expression in a target cell in vivo or in vitro. 

2. The vector according to claim 1 wherein said 
DNA sequences (a) comprise the native adenovirus 5« 
inverted terminal repeats and packaging sequences. 

3. The vector according to claim l wherein said 
DNA sequences (a) comprise the native adenovirus 3» 
inverted terminal repeat secpiences. 

4. The vector according to claim 1 wherein said 
selected gene (b) is a reporter gene. 

5. The vector according to claim 4 wherein said 
reporter gene is selected from the group consisting of 
the genes encoding fl-galactosidase, alkaline phosphatase 
and green fluorescent protein. 

6. The vector according to claim 1 wherein said 
selected gene (b) is a therapeutic gene. 
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7. The vector according to claim 6 wherein said 
therapeutic gene is selected from the group consisting of 
a normal CFTR gene, a DMD Becker allele and a normal LDL 
gene. 

8. A crippled adenovirus helper virus comprising a 
modified adenovirus sequence in place of native 
adenovirus sequence map units o-l, which modification 
reduces the packaging efficiency of said virus, said 
virus also containing selected adenovirus genes necessary 
to direct a productive viral infection. 

9. The helper virus according to claim 8 wherein 
said modified sequence comprises: 

i. a fragment of adenovirus map units 0-1; 

ii. a fragment of (i) containing a 5* inverted 
terminal repeat and between one to four selected 
packaging sequences, 

iii. a modified fragment of (i) containing at 
least one PAC consensus sequence in place of at least one 
native PAC sequence; and 

iv. a modified fragment of (ii), wherein said 
native PAC sequences are mutated to contain modified 
seqpiences. 

10. The virus according to claim 8 wherein said 
modified sequence comprises Ad5 base pairs 1-269. 

11. The virus according to claim 8 wherein said 
sequence (ii) comprises Ad5 base pairs 1-321. 

12. The virus according to claim 8 wherein said 
helper adenovirus is conjugated to a p ly-cation 
secjuenc • 
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13. A methodl for producing a recombinant adenovirus 
which comprises transf acting a selected host cell with 

(a) a recombinant shuttle vector comprising 

i. the DNA sequences of, or 
corresponding to, a portion of Vhe genome of an 
adenovirus which comprises adenovirus 5* and 3' cis- 
elements necessary for replication and virion 
encapsidation in the absence of sequence encoding viral 
genes; and 

ii. a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
linked to the DNA of (a) and capable of expression in a 
target cell in vivo or in vitro; and 

(b) a helper adenovirus comprising sufficient 
adenoviriis gene sequences necessary for a productive 
viral infection, wherein said transfected host cell 
permits the formation of a recombinant virus comprising 
the DNA of (i) and (ii) in an adenoviral capsid, and 

isolating and piirifying the recombinant virus from 
said cell. 

14. The method according to claim 13, wherein said 
helper virus is a crippled helper virus comprising a 
modified adenovirus sequence in place of native 
adenovirus sequence map units 0-1, which modification 
reduces the packaging efficiency of said helper virus, 
said helper virus also containing selected adenovirus 
genes necessary to direct a productive viral infection* 

15 • The method according to claim 13 wherein said 
helper adenovirus is associated with a poly-cation 
sequence • 
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16. The method according to claim 13 wherein said 
vector is associated with said helper adenovirus 
conjugate in a single particle. 

17. The method according co claim 13 wherein said 
helper virus is an adenovirus sequence containing 
deletions of all or portions of the Ela and Elb genes. 

18. The method according to claim 13 wherein said 
helper virus is an adenovirus sequence containing 
deletions of all or a portion of the E3 gene. 

19. A recombinant adenovirus comprising 

i. the DNA of, or corresponding to, a 
portion of the genome of an adenovirus which comprises 
adenovirus 5* and 3*cis-elements necesssury for 
replication and virion encapsidation in the absence of 
sequence encoding viral ^enes; 

ii. a selected gene operatively linked to 
regulatory sequences directing its expression, said gene 
linked to the DNA of (a) and capable of expression in a 
target cell in vivo or in vitro; 

said DNA and gene encapsidated in an adenoviral 
caps id. 

20. The virus according to claim 19 wherein said 
viral capsid is a capsid of an adenovirus serotype 
selected from the group consisting of types 2, 4, 5, 7, 
12 and 40. 

21. The virus according to claim 19 wherein said 
selected gene is a CFTR gene, a DKD gene and an LDL gene. 
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22. The use Of a recomblnetnt adenovirus according 
to claim 19 for the manufacture of a pharmaceutical 
composition suitable for delivering and integrating a 
selected gene into the chromosome of a target cell. 
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FIGURE 3A 

GAACTCGAGC AGCT6AAGCT TGAATTCCAT CATCAATAAT ATACCTTATT 50 

TTGGATT6AA 6CCAATATGA TAATGAGGGG GTG6A6TTTG TGAC6TGGCG 100 

CGGGGCGTGG GAACGG66CG G6TGAC6TAG 6TTTTAGGGC 6GAGTAACTT 150 

GTATGTGTTG G6AATTGTAG TTTTCTTAAA AT66GAAGTT ACGTAAC6TG 200 

GGAAAACG6A AGTGACGATT TGAGGAAGTT GTGGGTTTTT TGGCTTTCGT 250 

TTCTGGGCGT AGGTTC6CGT GCGGTTTTCT GGGTGTTTTT TGTGGACTTT 300 

AACCGTTACG TCATTTTTTA 6TCCTATATA TACTC6CTCT GCACTT6GCC 350 

CTTTTTTACA CTGTGACTGA TTGAGCTGGT GCCGTGTCGA GTGGTGTTTT 400 

TTTAATAGGT TTTCTTTTTT ACTGGTAAGG CTGACTGTTA GGCTGCCGCT 450 

GTGAAGCGCT GTATGTTGTT CTGGAGC6GG A6GGTGCTAT TTTGCCTAGG 500 

CAG6AGGGTT TTTCAGGTGT TTATGTGTTT TTCTCTCCTA TTAATTTT6T 550 

TATACCTCCT ATGGGGGCTG TAATGTTGTC TCTACGCCTG CGGGTATGTA 600 

TTCCCCCCAA GCTTGCATGC CT6CAGGTCG ACTCTAGAGG ATCCGAAAAA 650 

ACCTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAAT6AA TGCAATTGTT 700 

GTTGTTAACT TGTTTATTGC AGCTTATAAT GGTTACAAAT AAAGCAATAG 750 

CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT TCTAGTTGTG 800 

GTTTGTCCAA ACTCATCAAT GTATCTTATC ATGTCTGGAT CCCCGCGGCC 850 

GCCTAGAGTC GAGGCCGAGT TT6TCAGAAA GCAGACCAAA CAGCGGTTGG 900 

AATAATAGCG AGAACAGAGA AATAGCGGCA AAAATAATAC CCGTATCACT 950 

TTTGCTGATA TGGTTGATGT CATGTAGCCA AATCGGGAAA AACGGGAAGT 1000 

AGGCTCCCAT GATAAAAAAG TAAAAGAAAA A6AATAAACC GAACATCCAA 1050 

AAGTTTGTGT TTTTTAAATA GTACATAATG GATTTCCTTA CGCGAAATAC 1100 

GGGCAGACAT GGCCTGCCCG GTTATTATTA TTTTTGACAC CAGACCAACT 1150 

GGTAATGGTA GCGACCGGCG CTCAGCTGTA ATTCCGCCGA TACTGACGGG 1200 

CTCCAGGAGT CGTCGCGACC AATCCCCATA TGGAAACCGT CGATATTCAG 1250 

CCATGTGCCT TCTTCCGCGT GCAGCAGATG GCGATGGCTG CTTTCCATCA 1300 

GTTGCTGTTG ACTGTAGCGG CTGATGTTGA ACTGGAAGTC GCCGCGCCAC 1350 
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FIGURE 3C 
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FIGURE 3D 

AGTATCGGCC TCAGGAAGAT CGCACTCCAG CCAGCTTTCC GGCACCGCTT 4000 

CTGGTGCC66 AAACCA6GCA AA6CGCCATT CGCCATTCA6 GCTGCGCAAC 4050 

T6TTG6GAAG GGCGATC6GT GC6G6CCTCT TC';CTATTAC GCCAGCTGGC 4100 

CAAAGGGGGA TGTGCTGCAA GGCGATTAAG TTGGGTAACG CCAGGGTTTT 4150 

CCCA6TCACG ACGTTGTAAA ACGACGGGAT CGCGCTTGAG CAGCTCCTTG 4200 

CTGGTGTCCA GACCAATGCC TCCCAGACCG GCAACGAAAA TCACGTTCTT 4250 

GTTGGTCAAA GTAAACGACA TGGT6ACTTC TTTTTTGCTT TAGCAGGCTC 4300 

TTTCGATCCC CGGGAATTGC GGCCGCGGGT ACAATTCC6C AGCTTTTAGA 4350 

6CAGAAGTAA CACTTCC6TA CAGGCCTA6A A6TAAAGGCA ACATCCACTG 4400 

AGGAGCAGTT CTTTGATTTG CACCACCACC GGATCCGGGA CCTGAAATAA 4450 

AAGACAAAAA GACTAAACTT ACCAGTTAAC TTTCTGGTTT TTCAGTTCCT 4500 

CGAGTACCGG ATCCTCTA6A 6TCCG6AGGC T6GATC6GTC CCGGTCTCTT 4550 

CTATGGAGGT CAAAACAG06 TGGATGGCGT CTCCAGGCGA TCTGACG6TT 4600 

CACTAAACGA GCTCTGCTTA TATAGACCTC CCACCGTACA CGCCTACCGC 4650 

CCATTTGCGT CAATGGGGCG GAGTTGTTAC GACATTTTGG AAAGTCCCGT 4700 

TGATTTTGGT GCCAAAACAA ACTCCCATTG AOGTCAATGG GGTGGAGACT 4750 

TGGAAATCCC CGTGAGTCAA ACCGCTATCC ACGCCCATTG ATGTACTGCC 4800 

AAAACCGCAT CACCAOXKSTA ATAGCGATGA CTAATAC6TA GATGTACTGC 4850 

CAAGTAGGAA AGTCCCATAA GGTCATGTAC TGGGCATAAT GCCAGGCGGG 4900 

CCATTTACCG TCATTGACGT CAATAGGGGG CGTACTTGGC ATATGATACA 4950 

CTTGATGTAC TGCCAAGTGG GCAGTTTACC GTAAATACTC CACCCATTGA 5000 

CGTCAATGGA AAGTCCCTAT TGGCGTTACT ATGGGAACAT ACGTCATTAT 5050 

T6ACGTCAAT GGGCGGGGGT CGTTGGGCGG TCAGCCAGGC GGGCCATTTA 5100 

CCGTAAGTTA TGTAACGACC TGCAGGTCGA CTCTAGAGGA TCTCCCTAGA 5150 

CAAATATTAC GCGCTATGAG TAACACAAAA TTATTCAGAT TTCACTTCCT 5200 

CTTATTCAGT TTTCCCGCGA AAATGGCCAA ATCTTACTCG GTTACGCCCA 5250 
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AaaAAWw V X 


^ X w^lx* X A\H9i» 


AAAA 1 Ala wi^ 








xAvAGCGCTT 


CACAGCGGCA 




CAGCCTTACC 


2U0 


AGXAAAAAiiG 


AAAACwTATT 


AAAAAAACAC 


CACTCGACZAC 


GGGACCAGCT 


290 


ClUiTCAGTCA 


CAGTGTAAAA 


AAGGGCCAAG 


TGCAGAGCGA 


GTATATATAG 


300 




TGACGTAACG 


GttAAAGTCC 


ACAAAAAACA 


#^/%Mlk ^ Ik H Ik % ^ 

CCCAGAAAAC 


OCA 

390 




^/ Mil « f>f^r^ K 
CCTACGCCCA 


GAAACGAAAG 


CCAAAAAACC 


^Ik ^Ik Ik /WflNII^^ 

CACAACTTCC 


400 




Awx X wCGX 1 i. 


TCCCACG^TA 


CGTAAC^TCC 


Ik fffwvwnm Ik ik/*% 
CAXXllAAGA 


A C A 

490 


AAAWlAWiAT 


TwWCAACACA 


TACAAG!raAC 


TCCGCCCTAA 


AACCTACGTC 


CA A 

900 


AWwWlvwWWWv 


X XwUvAuwCw 


wCGCGCCACG 


TCAwAAAwTC 


wACCwCwTCA 


99U 


X lAX WiXAl X 


IraVl XCAAXw 


CAAAATAAGG 


TATATTATTG 


ATGATGCTAG 


OUU 




AATATACCXT 


ATTTTGGATT 


GAAGCCAATA 


TGATAATGAG 


^ C A 

O90 


GGGGTGGAGT 


TOGTGACGTG 


6C6CG66GCG 


TG6GAACG6G 


GCGGGTGACG 


"V A A 

700 


TAGTAGTGT6 


GCGGAAGTGT 


GATGTT6GAA 


GTGTGGCGGA 


ACACATGTAA 


*v e A 

750 


GCGACGGAT6 


TGGCAAAA6T 


6AC6TTTTT6 


6T6TGCGCCG 


6T6TACACAG 


800 


GAAGTGACAA 


TTTTCGCGCG 


GTTTTAGGCG 


GATGTTGTAG 


TAAATTTGGG 


850 


CGTAACCGAG 


TAAGATTTGG 


CCATTTTCGC 


GG6AAAACTG 


AATAAGAGGA 


900 


AGT6AAATCT 


GAATAATTTT 


GTGTTACTCA 


TAGCGCGTAA 


TATTTGTCTA 


950 


GGGAGATCAG 


CCTGCAGGTC 


GTTACATAAC 


TTAC6GTAAA 


TGGCCCGCCT 


1000 


6GCT6ACCGC 


CCAACGACCC 


CCGCCCATT6 


AC6TCAATAA 


TGACGTATGT 


1050 


TCCCATAGTA 


ACGCCAATAG 


GGACTTTCCA 


TTGACGTCAA 


TGGGTGGAGT 


1100 


ATTTACGGTA 


AACTGCCCAC 


TTG6CA6TAC 


ATCAA6T6TA 


TCATATGCCA 


1150 


AGTACX^CCCC 


CTATTGACGT 


CAATGACGGT 


AAATG6CCCG 


CCTGGCATTA 


1200 


TGCCCAGTAC 


ATGACCTTAT 


GG6ACTTTCC 


TACTTGGCAG 


TACATCTACG 


1250 


TATTAGTCAT 


CGCTATTACC 


ATGGTGATGC 


GGTTTT6GCA 


6TACATCAAT 


1300 



SUBSTITUTE SHEET (RULE 26) 



wo 96/13597 



PCr/US9S/14017 



11/45 
FIGURE 5B 





AlvwGGX XXajA 


CTvACGGGGA 


TTTCCAAGTC 


TCCACCCwAT 






^^^^^^ ^^fn^Rf^ti^^fR 


TTTGGCACCA 


AAATCAACGG 


0* H ^VfWVWI^^ Ik Ik 

GACTXXCCAA 


X4QD 




wwVCTCCGww 


CCATTGACGC 


AAATGGGCGG 


TAGGCGTGTA 


^ A RA 


^» Ira X wUVj Al9Vi 


X wX A X AT AAG 


CAGAGwTwGT 


TTA6TGAACC 


GTCAGATwGw 


xouu 


CTvmAGACGC 


CATCCACGCT 


GTTTTGACCT 


CCATAGAAGA 


CACCGGGACC 


« RRA 


GATCCAGCCT 


CCGGACTCTA 


GAGGATCCGG 


TACTCGAGGA 


ACTGAAAAAC 


1600 


CaGaaaGTTa 


ACTGGTAAGT 


TTAGTCTTTT 


TGTCTTTTAT 


TTCAGGTCCC 


1650 


GGATCCGGTG 


GTGGTGCAAA 


TCAAAGAACT 


GCTCCTCAGT 


GGhTGTTGCC 


1700 


TTTACTTCTA 


GGCCTGTACG 


GAAGTGTTAC 


TTCTGCTCTA 


AAAGCTGCGG 


1750 


AATTGTACCC 


6CGGCCGCAA 


TTCCCGG6GA 


TCGAAA6A6C 


CTGCTAAAGC 


1800 


AAAAAAGAAG 


TCACCAT6TC 


GTTTACTTTG 


ACCAACAAGA 


ACGTGATTTT 


1850 


CGTTGCC6GT 


CTGGGA6GCA 


TTG6TCTGGA 


CACCAGCAAG 


6AGCTGCTCA 


1900 


AGCGCGATCC 


CGTCGTTTTA 


CAACGTCGTG 


ACTGGGAAAA 


CCCTGGCGTT 


1950 


ACCCAACTTA 


ATCGCCTTGC 


AGCACATCCC 


CCTTTCGCGA 


GCTGGCGTAA 


2000 


TA6CGAA6AG 


6CCCGCACCG 


ATCGCCCTTC 


CCAACAGTTG 


CGCAGCCTGA 


2050 


ATGGC6AATG 


GCGCTTTGCC 


TGGTTTCC66 


CACCAGAAGC 


6GTGCCGGAA 


2100 


AGCTG6CT66 


AGTGCGATCT 


TCCTGAG6CC 


GATACTGTCG 


TCGTCCCCTC 


'2150 


AAACTGGCAG 


ATGCACGGTT 


ACGATGCGCC 


CATCTACACC 


AACGTAACCT 


2200 


ATCCCATTAC 


GGTCAATCCG 


CCGTTTGTTC 


CCAC66A6AA 


TCCGACGGGT 


2250 


T6TTACTCGC 


TCACATTTAA 


TGTTGAT6AA 


A6CTG6CTAC 


AGGAAGGCCA 


2300 


GACGCGAATT 


ATTTTTGAT6 


GCGTTAACTC 


GGC6TTTCAT 


CTCTGGTGCA 


2350 


ACGGGCGCTG 


GGTCGGTTAC 


GGCCAGGACA 


GTCGTTTGCC 


GTCTGAATTT 


2400 


GACCTGAGCG 


CATTTTTACG 


CGCCGGAGAA 


AACCGCCTCG 


CGGTGATGGT 


2450 


GCTGCGTTGG 


AGTGACG6CA 


GTTATCTGGA 


AGATCA66AT 


ATGTGGCGGA 


2500 


TGAGCGGCAT 


TTTCC6T6AC 


6TCTCGTTGC 


TGCATAAACC 


GACTACACAA 


2550 


ATCAGCGATT 


TCCATGTTGC 


CACTCGCTTT 


AATGATGATT 


TCAGCCGCGC 


2600 
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T6TACT6GA6 


GCTGAAGTTC 


AGATGTGCGG 


CGAGTT6C6T 


GACTACCTAC 


2650 


GGGTAACAGT 


TTCTTTATG6 


CAG66T6AAA 


CGCAGGTCGC 


CA6CGGCACC 


2700 


6C6CCTTTCC 


6C66T6AAAT 


TATC6AT6A6 


C6T66TG6TT 


AT6CCGATCG 


2750 


C6TCACACTA 


CGTCTGAACG 


TCGAAAACCC 


6AAACTGTGG 


AGCGCCGAAA 


2800 


TCCCGAATCT 


CTATC6TGCG 


GTGGTTGAAC 


TGCACACCGC 


CGACGGCACG 


2850 


CTGATTGAA6 


CAGAAGCCT6 


CGATGTCGGT 


TTCCGCGAGG 


TGCGGATT6A 


2900 


AAATGGTCT6 


CTGCTGCT6A 


ACGGCAA6CC 


6TT6CTGATT 


CGAGGCGTTA 


2950 


ACC6TCAC6A 


GCATCATCCrr. 


CTGCATG6TC 


AG6TCATGGA 


TGAGCA6ACC 


3000 


AT66TGCAG6 


ATATCCTGCT 


6ATGAAGCAG 


AACAACTTTA 


ACGCCGTGCG 


3050 


CT6TTCGCAT 


TATCC6AACC 


ATCCGCT6TG 


GTACACGCTG 


TGCGACCGCT 


3100 


ACG6CCT6TA 


T6TG6TG6AT 


GAAGCCAATA 


TTGAAACCCA 


CG6CATGGT6 


3150 


CCAAT6AATC 


GTCT6ACCGA 


TGATCCGCGC 


TGGCTACCGG 


CGATGAGC6A 


3200 


AC6C6TAAC6 


CGAATGGTGC 


AGCGCGATCG 


TAATCACCCG 


AGTGTGATCA 


3250 


TCTGCTCGCT 


G66GAAT6AA 


TCAGGCCACG 


6C6CTAATCA 


CGACGC6CTG 


3300 


TATC6CTGGA 


TCAAATCTGT 


CGATCCTTCC 


CGCCCG6TGC 


AGTATGAAGG 


3350 


CG6CX3GA6CC 


6ACACCACGG 


CCACCGATAT 


TATTTGCCC6 


AT6TACGCGC 


3400 


GC6TGGATGA 


AGACCAGCCC 


TTCCCGGCTG 


TGCCGAAATG 


GTCCATCAAA 


3450 


AAATGGCTTT 


C6CTACCTGG 


AGAGACGCGC 


CCGCTGATCC 


TTTGCGAATA 


3500 


CGCCCAC6C6 


ATGGGTAACA 


GTCTTGGCGG 


TTTCGCTAAA 


TACTGGCAGG 


3550 


CGTTTC6TCA 


GTATCCCCGT 


TTACAGGGCG 


GCTT0GTCT6 


GGACTGGGTG 


3600 


GATCA6TCGC 


TGATTAAATAs T6ATGAAAAC 


GGCAACCCGT 


GGTCGGCTTA 


3650 


CGGCGGTGAT 


TTTGGCGATA 


CGCCGAACGA 


TCGCCAGTTC 


TGTATGAACG 


3700 


GTCTGGTCTT 


TGCCGACCGC 


ACGCCGCATC 


CAGCGCTGAC 


GGAAGCAAAA 


3750 


CACCAGCAGC 


AGTTTTTCCA 


GTTCCGTTTA 


TCCGGGCAAA 


CCATCGAAGT 


3800 


GACCA6C6AA 


TACCT6TTCC 


GTCATAGCGA 


TAACGAGCTC 


CTGCACT66A 


3850 


TGGTGGCGCT 


GGATGGTAA6 


CCGCTGGCAA 


GCGGTGAAGT 


GCCTCTGGAT 


3900 
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GTCGCTCCAC AAGGTAAACA GTTGATT6AA CTGCCTGAAC TACCGCAGCC 3950 

GGAGA6CGCC GGGCAACTTCT GGCTCACAGT ACGCGTAGTG CAACCGAACG 4000 

CGACCGCATG GTCAGAAGCC GGGCACATCA GCGCCTGGCA GCAGTGGCGT 4050 

CTGGCGGAAA ACCTCAGTGT GACGCTCCCC GCPGCGTCCC ACGCCATCCC 4100 

GCATCTGACC ACCAGCGAAA TGGATTTTTG CATCGAGCTG GGTAATAAGC 4150 

GTTGGCAATT TAACCGCCAG TCAGGCTTTC TTTCACAGAT GTGGATTGGC 4200 

GATAAAAAAC AACTGCTGAC GCCGCTGCGC GATCAGTTCA CCCGTGCACC 4250 

GCTGGATAAC GACATTGGCG TAAGTGAAGC GACCCGCATT GACCCTAACG 4300 

CCTGGGTCGA ACGCTGGAAG GCGGCGGGCC ATTACCAGGC CGAAGCAGOG 4350 

TTGTTGCAGT GCACGGCAGA TACACTTGCT GATGCGGTGC TGATTACGAC 4400 

CGCTCACGCG TGGCAGCATC AGGGGAAAAC CTTATTTATC AGCCGGAAAA 4450 

CCTACCGGAT TGATGGTAGT 6GTCAAATGG CGATTACCGT TGATGTTGAA 4500 

6TGGCGAGC6 ATACACCGCA TCCG6CG0GG ATTGGCCTGA ACTGCCAGCT 4550 

GGCGCAGGTA GCAGAGCG6G TAAACT6GCT C6GATTAGGG CCGCAAGAAA 4600 

ACTATCCC6A CCGCCTTACT GCCGCCTGTT TTGACCGCTG GGATCTGCCA 4650 

TTGTCAGACA TGTATACCCC GTACGTCTTC CCGAGCGAAA ACGGTCTGCG 4700 

CTGCGGGACG CGCGAATTGA ATTATGGCCC ACACCAGTGG CGCGGCGACT 4750 

TCCA6TTCAA CATCAGCCGC TACAGTCAAC AGCAACTGAT GGAAACCAGC 4800 

CATCGCCATC TGCTGCACGC GGAAGAAGGC ACATGGCTGA ATATCGACGG 4850 

TTTCCATATG GGGATTGGTG GCGACGACTC CTGGAGCCCG TCAGTATCGG 4900 

CGGAATTACA GCTGAGCGCC GGTCGCTACC ATTACCAGTT GGTCTGGTGT 4950 

CAAAAATAAT AATAACCGGG CAGGCCAT6T CTGCCCGTAT TTCGCGTAAG 5000 

GAAATCCATT ATGTACTATT TAAAAAACAC AAACTTTTGG ATGTTCGGTT 5050 

TATTCTTTTT CTTTTACTTT TTTATCATGG GAGCCTACTT CCCGTTTTTC 5100 

CCGATTTGGC TACATGACAT CAACCATATC AGCAAAAGTG ATACGGGTAT 5150 

TATTTTTGCC GCTATTTCTC TGTTCTCGCT ATTATTCCAA CCGCTGTTTG 5200 

GTCTGCTTTC TGACAAACTC GGCCTCGACT CTAGGCGGCC GCGGGGATCC 5250 
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AGACATGATA AGATACATTG ATGAGTTTGG ACAAACCACA ACTA6AAT6C 


5300 


A6T6AAAAAA AT6CTTTATT TGT6AAATTT GTGATGCTAT TGCTTTATTT 


5350 


GTAACCATTA TAAGCTGCAA TAAACAAGTT AACAACAACA 


ATTGCATTCA 


5400 


TTTTATGTTT CAGGTTCAGG GGGAGGTGTG GGAGGTTTTT 


TCGGATCCTC 


5450 


TAGAGTCGAC GACGCGAGGC TGGAT6GCCT TCCCCATTAT 


GATTCTTCTC 


,5500 


GCTTCCGGCG GCATCGGGAT 6CCCGC6TTG GAGGCCATGC 


TGTCCAGGCA 


e p e M 

5550 


G6TAGATGAC GACCATCA6G GACAGCTTCA AGGATCGCTC 


GCGGCTCTTA 


5600 


CCAGCCTAAC TTCGATCACT GGACCGCTGA TCGTCACGGC 


GATTTATGCC 


5650 


GCCTCGGCGA GCACATGGAA CGGGTTGGCA TGGATTGTAG 


GCGCCGCCCT 


5700 


ATACCTTGTC TGCCTCCCCG C6TT6CGTCG CGGTGCATG6 


AGCC6GGCCA 


5750 


CCTCGACCTG AATGGAA6CC GG0G6CACCT C6CTAAC66A 


TTCACCACTC 


5800 


GAAGAATT6G AGCCAATCAA TTCTTGCGGA GAACTGTGAA 


TGCGCAAACC 


5850 


AACCCTTGGC AGAACATATC CATCGC6TCC GCCATCTCCA 


GCAGCCGCAC 


5900 


GCGGCGCATC TCGGGCAGC6 TTGG6TCCTG GCCACGGGTG 


CGCATGATCG 


5950 


TGCTCCTGTC GTT6AGGACC CGGCTAGGCT 66CGGGGTTG 


CCTTACTGGT 


6000 


TAGCAGAATG AATCACCGAT ACGCGAGCGA ACGTGAAGCG ACTGCT6CTG 


6050 


CAAAACGTCT GCGACCTGAG CAACAACATG AATGGTCTTC 


GGTTTCCGTG 


6100 


TTTCGTAAAG TCTGGAAACG CGGAAGTCAG CGCCCTGCAC 


CATTATGTTC 


6150 


CGGATCTGCA TCGCAGGATG CTGCTGGCTA CCCTGTGGAA 


CACCTACATC 


6200 


TGTATTAACG AAGCCTTTCT CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 


6250 


GT6TAGGTC6 TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 


6300 


GCCCGACCGC TGCGCCTTAT CCGGTAACTA TCGTCTTGAG 


TCCAACCCGG 


6350 


TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA 


CA66ATTAGC 


6400 


AGA6CGAGGT ATGTAGGCGG TGCTACAGA6 TTCTTGAAGT GGTGGCCTAA 


6450 


CTACGGCTAC ACTAGAAGGA CAGTATTT6G TATCTGC6CT 


CTGCTGAA6C 


6500 


CA6TTACCTT C66AAAAAGA GTTGGTAGCT CTTGATCCGG 


CAAACAAACC 


6550 
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ACC6CT6GTA 


GC66TG6TTT 


TTTTGTTT6C 


AA6CA6CA6A 


TTAC6CGCA6 


6600 


AAAAAAAGGA 


TCTCAAGAAG 


ATCCTTTGAT 


CTTTTCTACG 


GGGTCTGACG 


6650 


CTCA6TGGAA 


C6AAAACTCA 


C6TTAAGGGA 


TTTTGGTCAT 


GAGATTATCA 


6700 


AAAA66ATCT 


TCACCTA6AT 


CCTTTTAAAT 


TA^AAAT6AA 


GTTTTAAATC 


6750 


AATCTAAAGT 


ATATAT6AGT 


AAACTTGGTC 


TGACAGTTAC 


CAATGCTTAA 


6800 


TCAGTGA6GC 


ACCTATCTCA 


GCGATCTGTC 


TATTTCGTTC 


ATCCATAGTT 


6850 


GCCTGACTCC 


CCGTCGTGTA 


GATAACTAC6 


ATACGGGAGG 


GCTTACCATC 


6900 


TGGCCCCAGT 


6CTGCAATGA 


TACC6CGA6A 


CCCACGCTCA 


CCGGCTCCAG 


6950 


ATTTATCA6C 


AATAAACCA6 


CCA6CCG6AA 


GGGCC6A6CG 


CAGAA6TGGT 


7000 


CCT6CAACTT 


TATCCGCCTC 


CATCCAGTCT 


ATTAATTGTT 


GCCGGGAAGC 


7050 


TAGAGTAAGT 


AGTTCGCCAG 


TTAATAGTTT 


GCGCAACGTT 


GTTGCCATTG 


7100 


CTGCAGGCAT 


CGTGGTGTCA 


CGCTCGTCGT 


TTGGTATGGC 


TTCATTCAGC 


7150 


TCC6GTTCCC 


AACGATCAAG 


GC6AGTTACA 


TCATCCCCCA 


TGTTGT6CAA 


7200 


AAAA6C66TT 


AGCTCCTTCG 


6TCCTC06AT 


CGTTGTCAGA 


AGTAA6TTGG 


7250 


CC6CA6TGTT 


ATCACTCATG 


GTTATGCCAG 


CACTGCATAA 


TTCTCTTACT 


7300 


GTCATGCCAT 


CCGTAAGATG 


CTTTTCTGTG 


ACTGGTGAGT 


ACTCAACGAA 


7350 


6TCATTCTGA 


GAATAGT6TA 


TGCG6C6ACC 


GA6TTGCTCT 


TGCCC6GC6T 


7400 


CAACACGGGA 


TAATACC6C6 


CCACATA6CA 


CAACTTTAAA 


A6TGCTCATC 


7450 


ATTGGAAAAC 


GTTCTTC6GG 


GCGAAAACTC 


TCAAGGATCT 


TACCGCTGTT 


7500 


GA6ATCCA6T 


TCGATGTAAC 


CCACTCGTGC 


ACCCAACTGA 


TCTTCAGCAT 


7550 


CTTTTACTTT 


CACCAGCGTT 


TCTGGGTGAG 


CAAAAACAGG 


AAGGCAAAAT 


7600 


6CCGCAAAAA 


A6GGAATAA6 


GGCGACAC6G 


AAATGTT6AA 


TACTCATACT 


7650 


CTTCCTTTTT 


CAATATTATT 


6AA6CATTTA 


TCAGGGTTAT 


TGTCTCAT6A 


7700 


GCG6ATACAT 


ATTT6AATGT 


ATTTA6AAAA 


ATAAACAAAT 


A66GGTTCCG 


7750 


CGCACATTTC 


CCCGAAAAGT 


GCCACCTGAC 


GTCTAAGAAA 


CCATTATTAT 


7800 


CATGACATTA 


ACCTATAAAA 


ATAGGCGTAT 


CACGAGGCCC 


TTTCGTCTTC 


7850 


AA 










7852 
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TCTTCCGCTT 


CCTC6CTCAC 


TGACTCGCT6 


CGCTCG6TC6 


TTC66CTGC6 


50 


GC6A6C66TA 


TCAGCTCACT 


CAAAGGCGGT 


AATACXXSTTA 


TCCACAGAAT 


100 


CA6GGGATAA 


C6CAGGAAAG 


AACATGTGAG 


CAAAAGGCCA 


GCAAAAGGCC 


150 


A6GAACC6TA 


AAAAG6CCGC 


GTTGCT66CG 


TTTTTCCATA 


GGCTCCGCCC 


200 


CCCT6AC6A6 


CATCACAAAA 


ATC6ACGCTC 


AAGTCAGAGG 


TGGCGAAACC 


250 


CGACAGGACT 


ATAAAGATAC 


CAG6CGTTTC 


CCCCT66AAG 


CTCCCTCGTG 


300 


CGCTCTCCTG 


TTCC6ACCCT 


6CC6CTTACC 


GGATACCTGT 


CCGCCTTTCT 


350 


CCCTTCGGGA 


AGCGT6GC6C 


TTTCTCATAG 


CTCACGCTGT 


AGGTATCTCA 


400 


GTTCGGTGTA 


6GTCGTTC6C 


TCCAAGCTGG 


6CTGTGTGCA 


C6AACCCCCC 


450 


GTTCAGCCC6 


ACC6CTGCGC 


CTTATCC6GT 


AACTATCGTC 


TTGAGTCCAA 


500 


CCC66TAAGA 


CAC6ACTTAT 


C6CCACTGGC 


A6CAGCCACT 


6GTAACAG6A 


550 


TTAGCAGAGC 


GAGGTATGTA 


GGCGGTGCTA 


CAGAGTTCTT 


GAAGTGGTGG 


600 


CCTAACTAC6 


6CTAGACTAG 


AAGAACAGTA 


TTTGGTATCT 


6CGCTCTGCT 


650 


GAA6CCA6TT 


ACCTTCGGAA AAA6A6TTGG 


TAGCTCTTGA 


TCCGGCAAAC 


700 


AAACCACCGC 


TGGTA6C6GT 


GGTTTTTTTG 


TTTGCAAGCA 


GCA6ATTACG 


750 


CGCAGAAAAA 


AAGGATCTCA A6AA6ATCCT 


TTGATCTTTT 


CTACGGGGTC 


800 


TGACGCTCAG 


TGGAACGAAA 


ACTCACGTTA 


AGGGATTTTG 


GTCATGAGAT 


850 


TATCAAAAAG 


GATCTTCACC 


TAGATCCTTT 


TAAATTAAAA 


ATGAAGTTTT 


900 


AAATCAATCT 


AAAGTATATA 


TGAGTAAACT 


TGGTCTGACA 


GTTACCAAT6 


950 


CTTAATCAGT 


GAG6CACCTA 


TCTCAGCGAT 


CTGTCTATTT 


CGTTCATCCA ' 


1000 


TAGTTGCCTG 


ACTCCCCGTC 


6TGTAGATAA 


CTACGATACG 


GGAGGGCTTA 


1050 


CCATCTGGCC 


CCA6TGCTGC 


AATGATACCG 


CCAGACCCAC 


GCTCACCGGC 


1100 


TCCAGATTTA 


TCAGCAATAA 


ACCAGCCAGC 


CGGAAGGGCC 


GAGCGCAGAA 


1150 


6T6GTCCTGC 


AACTTTATCC 


GCCTCCATCC 


A6TCTATTAA 


TTGTTGCC6G 


1200 


GAAGCTA6AG 


TAAGTAGTTC 


GCCAGTTAAT 


AGTTTGCGCA 


ACGTTGTTGC 


1250 


CATTGCTACA 


GGCATCGTGG 


TGTCACGCTC 


GTCGTTTGGT 


ATGGCTTCAT 


1300 
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TTCCCAACGA 


TCAAGGCGAG 


TTACATGATC 


CCCCATGTTG 


1350 


Wf^f*^ A A & % & ^ 

TGwVAAAAAG 


C66TTA6CTC 


CTTCGGTCCT 


CCGATCGTT6 


TCA6AA6TAA 


1400 


GTTGGCCGCA 


GTGTTATCAC 


TCATGGTTAT 


GGCAGCACTG 




1450 




6CCATCC6TA A6AT6CTTTT 


CTGTGACTGG 


T6A6TACTCA 


1500 


ACCAAGTCAT 


TCTGA6AATA 


GT6TATGC6G 


06ACC6A6TT 


GCTCTTGCCC 


1550 


GGC6TCAATA 


C6GGATAATA 


CC6CGCCACA TAGCA6AACT 


TTAAAA6TGC 


1600 


TCATCATT66 


AAAAC6TTCT 


TCG6GGCGAA 


AACTCTCAAG 


GATCTTACCG 


1650 


CTGTTGAGAT 


CCAGTTCGAT 


GTAACCCACT 


CGTGCACCCA 


ACTGATCTTC 


1700 


AGCATCTTTT 


ACTTTCACCA 


GCGTTTCTGG 


6TGAGCAAAA 


ACAGGAAGGC 


1750 


AAAAT6CCGC AAAAAAG66A ATAA6GGTOA CAC66AAATG 


TTGAATACTC 


1800 


ATACTCTTCC 


TTTTTCAATA 


TTATT6AA6C 


ATTTATCAG6 


GTShTTGTCT 


1850 


CATGA6CG6A 


TACATATTTG 


AATGTATTTA 


GAAAAATAAA 


CAAATAGGGG 


1900 


TTCCGCGCAC 


ATTTCCCCGA 


AAAGTGCCAC 


CTGACGTCTA 


A6AAACCATT 


1950 


ATTATCATGA 


CATTAACCTA 


TAAAAATA6G 


C6TATCACGA 


66CCCTTT06 


2000 


TCTOGCGCGT 


TTCGGTGAT6 


AC6GTGAAAA 


CCTCTGACAC 


ATGGAGCTCC 


2050 


C6GAGACXK3T 


CACAGCTTGT 


CTGTAA6CG6 


ATGCCGGGAG 


CA6ACAA6CC 


2100 


CGTCAGGGC6 


CGTCAGCGGG 


TGTTGGCGGG 


T6TCGGGGCT 


GG CTTAACTA 


2150 


TGCGGCATCA 


GAGCAGATTG 


TACTGA6A6T 


GCACCATAAA 


ATTGTAAACG 


2200 


TTAATATTTT 


6TTAAAATTC 


GC6TTAAATT 


TTT6TTAAAT 


CAGCTCATTT 


2250 


TTTAACCAAT A6GCC6AAAT 


CGGGAAAATC 


CCTTATAAAT 


CAAAAGAATA 


2300 


6CCCGAGATA 


66GTTGAGTG 


TTGTTCCAGT 


TTGGAACAAG 


AGTCCACTAT 


2350 


TAAAGAAC6T 


GGACTCCAAC 


GTCAAAGGGC 


GAAAAACCGT 


CTATCAGGGC 


2400 


GATGGCCCAC 


TACGTGAACC 


ATCACCCAAA 


TCAAGTTTTT 


TGGGGTCGAG 


2450 


GTGCCGTAAA 


GCACTAAATC 


GGAACCCTAA 


AGGGAGCCCC 


CGATTTAGAG 


2500 


CTTGACGGG6 


AAAGCCGGCG 


AACGTG6CGA 


GAAAGGAA6G 


GAAGAAAGCG 


2550 


AAAGGAGCGG 


GCGCTAGGGC 


GCT6GCAAGT 


GTAGCGGTCA 


CGCTGCGCGT 


2600 
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l9W X ilVilVVJV W 








fV^T E *P<* /"WIT 




V VAWiW A X \> W 


\9 X AAlvu AV AA 


9700 


AATACCGCAT 


CAGGCGuCAT 


TCuCCATTCA 




C X \* X 1 IvVauiWV 


A / 9U 


GGGCGATCGG 


TGCGGGCCTC 


TTCGCTATTA 


CGCCAGCTGG 


CGAAAGGGGG 


^ O A A 


ATGTGCTGCA 


AGGCGATTAA 


GTTGGGTAAC 


GCCAGGGTTT 


TCCGAGTCAC 


o a e f\ 
2850 


GACGTTGTAA 


AACGACG6CC 


AGT6CCAA6C 


TTAAGGTGCA 


CGGCCCACGT 


O A AA' 

2900 


GGCCACTA6T 


ACTTCTCGA6 


CTCT6TACAT 


6TCCGCG6TC 


GCGACGTACG 


IAEA 

2950 


CGTATCGATG 


GCGCCAGCTG 


CAGGCGGCCG 


CCATATGCAT 


CCTAGGCCTA 


3000 


TTAATATTCC 


6GA6TATACG 


TA6CCGGCTA 


AC6TTAACAA 


CCGGTACCTC 


O AR A 

j050 


TAGAACTATA 


GCTAGCCAAT 


TCCATCATCA 


ATAATATACC 


TTATTTTGGA 




CTGAAGCCAA 


TATGATAATG 


AGGGGGTGGA 


6TTTGTGAC6 


TGGCGvGGGG 


1 RA 


CGTGGGAACG 


GGGCGGGTGA 


CGTAGGTTTT 


AGGGCGGAGT 


AACTTGTATG 


J4BUv 


TGTTGGGAAT 


TGTAGTTTTC 


TTAAAATGG6 


AAGTTACGTA 


Ik ^^IW^^^ Ik Ik ?k 

ACGTGGGAAA 


J2D0 


ACGGAAGTGA 


CGATTTGAG6 


AAGTTGTGGG 


TTTTTTGGCT 


TTOGTTTCTC 


J300 


GGCGTA6GTT 


CGCGTGCGGT 


TTTCTGGGTG 


TlVriTrGl'GG 


ACTTTAACCG 


t ^ R A 

3350 


TTACGTCATT 


TTTTAGTCCT 


ATATATACTC 


6CTCTGCACT 


TGGCCCTTTT 


O A AA 

3400 


TTACACTGTG 


ACTGATTGAG 


CTGGTGCCGT 


GTCGAGTGGT 


G'x-x-i-^rrrTAA 


3450 


TA66TTTTCT 


TTTTTACTGG 


TAAGGCT6AC 


TGTTAGGCTG 


CCGCTGTGAA 


o e A A 

3500 


GCGCTGTATG 


TTGTTCTGGA 


GCGGGAGGGT 


6CTATTTT6C 


CTAGGCAGGA 


3590 


GGGTTTTTCA 


GGTGTTTATG 


TGTTTTTCTC 


TCCIATIMT 


TTTGTTATAC 


3600 


CTCCTAT6GG 


GGCTGTAATG 


TTGTCTCTAC 


GCCTGCGGGT 


ATGTATTCCC 


3650 


CCCAAGCTTG 


CATGCCTGCA 


GGTCGACTCT 


AGAGGATCCG 


AAAAAACCTC 


3700 


CCACACCTCC 


CCCTGAACCT 


6AAACATAAA 


ATGAATGCAA 


TTGTTGTTGT 


3750 


TAACTTGTTT 


ATTGCAGCTT 


ATAATGGTTA 


CAAATAAAGC 


AATAGCATCA 


3800 


CAAATTTCAC 


AAATAAAGCA 


rrrrriTCAC 


TGCATTCTAG 


TTGTGGTTTG 


3850 


TCCAAACTCA 


TCAATGTATC 


TTATCATGTC 


TGGATCCCCC 


TAGCTTGCCA 


3900 
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AACCTACAG6 


TGG6GTCTTT 


CATTCCCCCC 


TTTTTCT6GA 


6ACTAAATAA 


3950 


AATCTTTTAT 


TTTATCTAT6 


GCTC6TACTC 


TATAGGCTTC 


AGCTGGT6AT 


4000 


ATTGTTGA6T 


CAAAACTA6A 


GCCTGGACCA 


CTGATATCCT 


GTCTTTAACA 


4050 


AATTG6ACTA 


ATCGCGGGAT 


CA6CCAATTC 


CATGAGCAAA 


TGTCCCATGT 


4100 


CAACATTTAT 


GCTGCTCTCT 


AAAGCCTTGT 


ATCTTGCATC 


TCTTCTTCTG 


4150 


TCTCCTCTTT 


CAGAGCA6CA 


ATCTG66GCT 


TAGACTTGCA 


CTTGCTTGA6 


4200 


TTCCGGTGG6 


6AAAGA6CTT 


CACCCTGTCG 


6A6GGGCT6A 


TGGCTTGCCG 


4250 


GAAGAGGCTC 


CTCTCGTTCA 


GCAGTTTCTG 


GATGGAATCG 


TACTGCC6CA 


4300 


CTTTGTTCTC 


TTCTAT6ACC 


AAAAATTGTT 


GGCATTCCAG 


CATTGCTTCT 


4350 


ATCCTGTGTT 


CACA6A6AAT 


TACTGTGCAA 


TCA6CAAAT6 


CTT6TTTTA6 


4400 


A6TTCTTCTA 


ATTATTT6GT 


ATGTTACT66 


ATCCAAAT6A 


6CACTGG6TT 


4450 


CATCAAGCAG 


CAAGATCTTC 


GCCTTACTGA 


GAACA6ATCT 


A6CCAAGCAC 


4500 


ATCAACT6CT 


T6TG6CCATG 


GCTTAG6ACA 


CA6CCCCCAT 


CCACAAGGAC 


4550 


AAA6TCAAGC 


TTCCCAG6AA 


ACTGTTCTAT 


CACAGATCTG 


A6CCCAACCT 


4600 


CATCTGCAAC 


TTTCCATATT 


TCTTGATCAC 


TCCACTGTTC 


ATA666ATCC 


4650 


AA6TTTTTTC 


TAAAT6TTCC 


AGAAAAAATA 


AATACTTTCT 


6TGGTATCAC 


4700 


TCCAAAGGCT 


TTCCTCCACT 


GTTGCAAAGT 


TATTGAATCC 


CAAGACACAC 


4750 


CATC6ATCTG 


GATTTCTCCT 


TCAGTGTTCA 


6TAGTCTCAA 


AAAAGCTGAT 


4800 


AACAAA6TAC 


TCTTCCCTGA 


TCCAGTTCTT 


CCCAAGA66C 


CCACCCTCTG 


4850 


GCCA6GACTT 


ATT6A6AA66 


AAAT6TTCTC 


TAATATG6CA 


TTTCCACCTT 


4900 


CT6TGTATTT 


TGCTGTGAGA 


TCTTTGACAG 


TCATTTGGCC 


CCCTGAGGGC 


4950 


.CAGATGTCAT 


CTTTCTTCAC 


GTGTGAATTC 


TCAATAATCA 


TAACTTTCGA 


500,0 


GAGTTG6CCA 


TTCTTGTAT6 


GTTTGGTTGA 


CTTGGTAGGT 


TTACCTTCTG 


5050 


TT66CATGTC 


AATGAACTTA 


AAGACTC6GC 


TCACAGATCG 


CATCAAGCTA 


5100 


TCCACATCTA 


TGCTGGAGTT 


TACAGCCCAC 


TGCAATGTAC 


TCATGATATT 


5150 


CATGGCTAAA 


GTCAGGATAA 


TACCAACTCT 


TCCTTCTCCT 


TCTCCTGTTG 


5200 
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TTAAAAT66A AAT6AAG6TA 


ACAGCAAT6A 


A6AAGATGAC 


AAAAATCATT 


5250 


TCTATTCTCA 


TTTGGAACCA 


6CGCA6TGTT 


6ACAGGTACA 


A6AACCAGTT 


5300 


66CA6TAT6T 


AAATTCAGAG 


CTTT6TG6AA 


CAGA6TTTCA 


AAGTAAG6CT 


5350 


GCC6TCC6AA 


6GCACGAAGT 


GTCCaVTAGTC 


CTTTTAAGCT 


TGTAACAAGA 


5400 


TGAGT6AAAA 


TTGGACTCCT 


GCCTTCA6AT 


TCCAGTTGTT 


TGAGTTGCTG 


5450 


TGAGGTTT66 


AGGAAATATG 


CTCTCAACAT 


AATAAAAGCC 


ACTATCACT6 


5500 


GCACT6TTGC 


AACAAAGATG 


TAGGGTTGTA 


AAACTGCGAC 


AACTGCTATA 


5550 


GCTCCAATCA 


CAATTAATAA 


CAACT6GATG 


AAGTCAAATA 


TGGTAAGAGG 


5600 


CAGAAGGTCA 


TCCAAAATTG 


CTATATCTTT 


GGAGAATCTA 


TTAAGAATCC 


5650 


CACCTGCTTT 


CAACGTGTT6 


AGGGTTGACA 


TAGGTGCTTG 


AAGAACAGAA 


5700 


TGTAACATTT 


TGTGGTGTAA 


AATTTTC6AC 


ACTGTGATTA 


GAGTATGCAC 


5750 


CAGTG6TAGA 


CCTCT6AAGA 


ATCCCATAGC 


AAGCAAA6TG 


TCGGCTACTC 


5800 


CCACGTAAAT 


GTAAAACACA 


TAATACGAAC 


TGGTGCTGGT 


GATAATCACT 


5850 


GCATA6CT6T 


TATTTCTACT 


AT6AGTACTA 


TTCCCTTTGT 


CTTGAAGAGG 


5900 


A6T6TTTCCA 


AGGAGCCACA 


GCACAACCAA 


AGAAGCAGCC 


ACCTCTGCCA 


5950 


GAAAAATTAC 


TAA6CACCAA 


ATTAGCACAA 


AAATTAA6CT 


CTTGTGGACA 


6000 


6TAATATATC 


GAA6GTATGT 


6TTCCAT6TA 


GTCACTGCT6 


GTATGCTCTC 


6050 


CATATCATCA 


AAAAAGCACT 


CCTTTAAGTC 


TTCTTCGTTA 


ATTTCTTCAC 


6100 


TTATTTCCAA 


6CCAGTTTCT 


TGAGATAACC 


TTCTT6AATA 


TATATCCAGT 


6150 


TCAGTCAAGT 


TTGCCTGAGG 


GGCCAGT6AC 


ACTTTTC6TG 


TGGATGCTGT 


6200 


TGTCTTTC66 


T6AATGTTCT 


6ACCTTG6TT 


aActgagtgt 


GTCATCAG6T 


6250 


TCAGGACAGA 


CTGCCTCCTT 


CGTGCCTGAA 


GCGTGGGGCC 


AGTGCTGATC 


6300 


ACGCTGATGC 


GAGGCAGTAT 


CGCCTCTCCC 


TGCTCAGAAT 


CTGGTACTAA 


6350 


GGACAGCCTT 


CTCTCTAAAG 


GCTCATCAGA 


ATCCTCTTCG 


ATGCCATTCA 


6400 


TTTGTAA6G6 


AGTCTTTTGC 


ACAAT6GAAA 


ATTTTCGTAT 


AGAGTTGATT 


6450 


GGATTGAGAA 


TAGAATTCTT 


CCrrTTTTCC 


CCAAACTCTC 


CAGTCTGTTT 


6500 
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AAAAGATTGT 


TTTTTTGTTT 


CT6TCCAGGA 


GACAGGAGCA 


TCTCCTTCTA 


6550 


AT6AGAAAC6 


GTGTAAG6TC 


TCAGTTAG6A 


TTGAATTTCT 


TCTTTCTGCA 


6600 


CTAAATTGGT 


CGAAAGAATC 


ACATCCCATG 


AGTTTTGAGC 


TAAAGTCTGG 


6650 


CTGTA6ATTT 


TGGAGTTCTG 


AAAAT6TCCC 


ATAAAAATAG 


CTGCTACCTT 


6700 


CAT6CAAAAT 


TAATATTTTG 


TCA6CTTTCT 


TTAAATGTTC 


CATTTTAGAA 


6750 


6T6ACCAAAA 


TCCTAGTTTT 


GTTAGCCATC 


AGTTTACAGA 


CACAGCTTTC 


6800 


AAATATTTCT 


TTTTCTGTTA 


AAACATCTA6 


GTATCCAAAA 


GGAGAGTCTA 


6850 


ATAAATACAA 


ATCAGCATCT 


TTGTATACTG 


CTCTTGCTAA 


AGAAATTCTT 


6900 


GCTC6TTGAC 


CTCCACTCAG 


TGTGATTCCA 


CCTTCTCCAA 


GAACTATATT 


6950 


GTCTTTCTCT 


GCAAACTTGG 


AGAT6TCCTC 


TTCTAGTTGG 


CATGCTTTGA 


7000 


TGACGCTTCT 


GTATCTATAT 


TCATCATAGG 


AAACACCAAA 


GATGATATTT 


7050 


TCTTTAATGG 


TGCCAGGCAT 


AATCCA6GAA 


AACTGAGAAC 


AGAATGAAAT 


7100 


TCTTCCACTG 


TGCTTAATTT 


TACCCTCT6A 


AGGCTCCAGT 


TCTCCCATAA 


7150 


TCATCATTAG 


AAGTGAAGTC 


TT6CCT6CTC 


CAGTGGATCC 


AGCAACCGCC 


7200 


AACAACTGTC 


CTCTTTCTAT 


CTTGAAATTA 


ATATCTTTCA 


GGACAGGAGT 


7250 


ACCAAGAAGT 


GAGAAATTAC 


T6AAGAA6A6 


GCTGTCATCA 


CCATTAGAAG 


7300 


TTTTTCTATT 


GTTATTGTTT 


TGTTTTGCTT 


TCTCAAATAA 


TTCCCCAAAT 


7350 


CCCTCCTCCC 


AGAAGGCTGT 


TACATTCTCC 


ATCACTACTT 


CTGTAGTCGT 


7400 


TAAGTTATAT 


TCCAATGTCT 


TATATTCTTG 


CTTTTGTAAG 


AAATCCTGTA 


7450 


TTTTGTTTAT 


T6CTCCAAGA 


6AGTCATACC 


ATGTTTGTAC 


AGCCCAGGGA 


7500 


AATTGCCGAG 


TGACCGCCAT 


GCGCA6AACA 


Atgcagaatg 


AGATGGTGGT 


7550 


GAATATTTTC 


CGGAGGATGA 


TTCCTTTGAT 


TAGTGCATAG 


GGAAGCACAG 


7600 


ATAAAAACAC 


CACAAAGAAC 


CCTGAGAAGA 


AGAAGGCTGA 


GCTATTGAAG 


7650 


TATCTCACAT 


AGGCTGCCTT 


CC6AGTCAGT 


TTCAGTTCTG 


TTTGTCTTAA 


7700 


GTTTTCAATC 


ATTTTTTCCA 


TTGCTTCTTC 


CCAGCAGTAT 


GCCTTAACAG 


7750 


ATTGGATGTT 


CTCGATCATT 


TCTGAGGTAA 


TCACAAGTCT 


TTCACTGATC 


7800 
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TTCCCAGCTC 


TCTGATCTCT 


6TACTTCATC 


ATCAITCTCC 


CTAGCCCAGC 


7850 


CT6AAAAAG6 


GCAAGGACTA 


TCAG6AAACC 


AA6TCCACAG 


AAGGCA6ACG 


7900 


CCT6TAACAA 


CTCCCAGATT 


A6CCCCAT6A 


^M.^^^ .^BiWft^M ^m^A^ 

GGAGTGCCAC 


TT6CAAAGGA 


7950 


GCGATCCAOl 


CGAAATGTGC 


CAATGCAA6T 


CCTTCATCAA 


ATTTGTTCA6 


8000 


GTTGTTGGAA 


AGGA6ACTAA 


CAAGTT6TCC 


AATACTTATT 


TTATCTAGAA 


8050 


CACGGCTTGA 


CAGCTTTAAA 


6TCTTCTTAT 


AAATCAAACT 


« « m 44m mm 44^OTi 

AAACATAGCT 


8100 


ATTCTCATCT 


6CATTCCAAT 


GTGATGAAGG 


CCAAAAATGG 


44*n44#444m^4ffn% 0^ 

CTGGGTGTAG 


8150 


GA6CAGTGTC .CTCACAATAA 


AGAGAAGGCA 


TAAGCCTATG 


CCTAGATAAA 


Q n A 


TC6CGATAGA 


GCGTTCCTCC 


TT6TTATCCG 


GGTCATAGGA 


hGCrlATGATrS: 


e o e f\ 

8250 


CTTCCCAGTA 


A6AGAGGCTG 


TACTGCTTTG 


6TGACTTCCC 


CTAAATATAA 


8300 


AAAGATTCCA 


TAGAACATAA 


ATCTCCAGAA 


AAAACATCGC 


CGAAGGGCAT 


8350 


TAATGAGTTT 


AG6ATTTTTC 


TTTGAAGCCA 


GCTCTCTATC 


CCATTCTCTT 


8400 


TCCAATTTTT 


CAGATAGATT 


6TCA6CAGAA 


TCAACAGAAG 


GGATTTGGTA 


8450 


TATGTCTGAC 


AATTCCA6GC 


GCTGTCTGTA 


TCCTTTCCTC 


AAAATTGGTC 


8500 


TG6TCCAGCT 


GAAAAAAAGT 


TTGGAGACAA 


C6CTGGCCTT 


TTCCAGA66C 


8550 


GACCTCTGCA 


TGGTCTCTCG 


GGCGCTGGGG 


TCCCTGCTAG 


GGCCGTCTGG 


8600 


GCTCAAGCTC 


CTAATGCCAA 


AGGAATTCCT 


GCAGCCCGGG 


GGATCCACTA 


8650 


GTTCTA6AGC 


GGCCGCCACC 


GCGGTGGCTG 


ATCCCGCTCC 


4444 44,444444 444444 44 

CGCCCGCCGC 


8700 


GC6CTTCGCT 


TTTTATA6G6 


CCGC0GCC6C 


CGCCGCCTCG 


4444m mm m % m 

CCATAAAAGG 


0790 


AAACTTTOGG 


AGC6CGC06C 


TCT6ATTGGC 


TGCC6CC6CA 


CCTCTCCGCC 


8800 


TCGCCCCGCC 


CCGCCCCTC6 


CCCCGCCCCG 


CCCCGCCTGG 


CGCGCGCCCC 


8850 


CCCCCCCCCC 


CCGCCCCCAT 


CGCTGCACAA 


AATAATTAAA 


AAATAAATAA 


8900 


ATACAAAATT 


GGGGGTGGGG 


AGGGGGGGGA 


GATGGGGAGA 


GTGAAGCAGA 


8950 


ACGTGGCCTC 


6AGTA6AT6T 


ACTGCCAAGT 


A6GAAAGTCC 


CATAAGGTCA 


9000 


T6TACTGGGC 


ATAATGCCA6 


GCGGGCCATT 


TACCGTCATT 


GACGTCAATA 


9050 


GGGGGCGTAC 


TTGGCATATG 


ATACACTTGA 


TGTACTGCCA 


AGTGGGCAGT 


9100 



snsnniiEsiEEr(iiiiiE2D 



wo 96/13597 



PCT/US9S/14017 



24/45 
FIGURE 7H 





TACTCCACCC ATTGACGTCA ATGGAAAGTC CCTATTGGCG 


A 4 iff M 

9150 




AACATACGTC 


ATTATTGACG 


TCAATGGGCG 


GGGGTCGTTG 


9200 




CAGGCGGGCC ATTTACCGTA AGTTATGTAA CGACCTGCAG 


9250 


vCXuiiXv»XwU 


CTAGACAAAT ATTACGCGCT ATGAGTAACA CAAAATTATT 


9300 


UAwAX X X wiw 


TTCCTCTTAT 


TCAGTTTTCC 


CGCGAAAAT6 


GCCAAATCTT 


9350 


AWX WblsX XAw 


GCCCAAATTT 


ACTACAACAT 


CCGCCTAAAA 


CCGCGCGAAA 


9400 


AX XviXwAwX X 


CCTGT6TACA 


CCGGCGCACA 


CCAAAAACGT 




9450 


AVtfA X W X 


CTTACATGTG 


TTCCGCCACA 


CTTGCAACAT 


CACACTTCCG 


9500 


W\***V#*1kW X A\» X 


ACGTCACCCG 


CCCCGTTCCC 


ACGCCCCGCG 


CCACGTCACA 


A C C A 

9550 


An W A WSwA w 


CCTCATTATC 


ATATTGGCTT 


CAATCCAAAA 


TAAGGTATAT 


9600 


XAX XiaAXAaAX 


6CTAGCATGC 


6CAAATTTAA 


AGCGCTGATA 


TCGATCGCGC 


9650 


GCAGATCT6T 


CATGATGATC 


ATT6CAATTG 


GATCCATATA 


TAGGGCCCGG 


9700 


GTTATAATTA 


CCTCAGGTCG 


ACGTCCCAT6 


GCCATTOGAA 


TTCGTAATCA 


9750 


TGGTCATAGC 


TGTTTCCTGT 


GTGAAATTGT 


TATCCGCTCA 


CAATTCCACA 


9800 


CAACATAC6A 


GCCGGAA6CA 


TAAA6T6TAA 


AGCCTGGGGT 


GCCTAAT6A6 


9850 


TGAGCTAACT 


CACATTAATT 


6CGTT6CGCT 


CACTGCCCGC 


TTTCCAGTCG 


9900 


GGAAACCTGT 


CGTGCCAGCT 


GCATTAATGA 


ATCGGCCAAC 


GCGCGGGGAG 


9950 


AGGCGGTTTG 


CGTATTGGGC 


GC 






9972 
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CCAATTCCAT 


CATCAATAAT 


ATACCTTATT 


TTGGATT6AA 


6CCAATATGA 


50 


TAATGA66G6 


GTGGAGTTT6 


TGACGTGGCG 


CGG6GCGTG6 


6AAC6GG6CG 


100 


6GTGAC6TA6 


GTTTTAGGGC 


GGAGTAACTT 


GTAtGTGTTG 


GGAATTGTAG 


150 


TTTTCTTAAA ATGGGAA6TT 


ACGTAACGTG 


GGAAAACGGA 


AGT6AC6ATT 


200 


T6A66AAGTT 


GTGGGTTTTT 


TGGCTTTCGT 


TTCTGGGC6T 


AGGTTCGC6T 


250 


6C66TTTTCT 


6GGTGTTTTT 


T6T66ACTTT 


AACCGTTACG 


TCATTTTTTA 


300 


GTCCTATATA 


TACTCGCTCT 


6CACTTGGCC 


crrrrrrACA 


CTGTGACT6A 


350 


TTG AG CTGGT 


GCCGTGTCGA 


GTGGTGTTTT 


TTTAATAGGT 


TTTCTTTTTT 


400 


ACTCGTAAGG 


CTGACT6TTA 


GGCTGCC6CT 


GTGAAGCGCT 


GTATGTTGTT 


450 


CTGGAGCGGG 


AGGGTGCTAT 


TTT6CCTA66 


CAG6AGGGTT 


TTTCAGGTGT 


500 


TTAfPGTGTTT 


TTCTCTCCTA 


TTAATTTTGT 


TATACCTCCT 


ATGGGG6CTG 


550 


TAATGTTGTC 


TCTACGCCTG 


CGGGTATGTA 


TTCCCCCCAA 


GCTT6CAT6C 


600 




ACTCTA6AGG 


ATCC6AAAAA 


ACCTCCCACA 


CCTCCCCCTG 


650 


AACCT6AAAC 


ATAAAATGAA 


T6CAATT6TT 


6TTGTTAACT 


TGTTTATT6C 


700 


A6CTTATAAT 


GGTTACAAAT 


AAAGCAATA6 


CATCACAAAT 


TTCACAAATA 


750 


AAGCATTTTT 


TTCACT6GAT 


TCTAGTT6T6 


GTTT6TCCAA 


ACTCATCAAT 


800 


GTATCTTATC 


ATGTCTGGAT 


CCCCGCGGCC 


GCTCTA6AAC 


TAGTGGATCC 


850 


CCCGGGCTGC 


AG6AATTCC6 


TAACATAACT 


GCGTGCTTTA 


TTGA6ATACA 


900 


CAGTAAAGCA 


GTAATATAAT 


ACAATA6TAA 


GGCATATATT 


TGGT6AAATC 


950 


TGATATGTT6 


T6AAAAT6CA 


6TAAAACTGA 


AGTTTAAAAA 


AATAATTAGT 


1000 


AAATGTTACA 


GT6TTGGTGT 


TAAAACACAA 


TCTATTATGA 


TACTCAAGTA 


1050 


AGAGTCCAGT 


ACCTG6AGAC 


AATGATGATA 


CATGCCATGT 


GATGATTAT6 


- 1100 


CTTCAGTTAC 


ACTGATTATG 


ATTTACACTT 


TAATACTTGA 


TGGTTATAAA 


1150 


6AACATGAAA 


TGATGTCCAA 


ATTATGCTTA 


AAATCAGCAA 


TAAAGCTCTC 


1200 


AGTTTTTATT 


CAAATATTTT 


GATAGATTCA 


CTCCAGAACT 


AATATCTAAA 


1250 
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FIGURE 


12B 






AGATAAAAC6 


AAAAGATTAA 


AACAAAACTA 


TGCACTCTAT 


CTACCTTG6A 


1300 


TTTTAGAATG 


AAACTTAAAA 


CTTCTTA6TA 


6GAAA66AAC 


CCCTT6TTTT 


1350 


AAATCTT6GT 


GAAAACAAAT 


CCTT6GATAA 


AGAAAATGCC 


CAGTGCCACA 


1400 


TAAAGGA6A6 


A6AGAGAGAA 


AAGCAAGACC 


A6AACCAAAT 


TTCAATTTGT 


1450 


TATCTTAGA6 


CTTT666TTT 


TCTTTTGGAA 


ATTATAAATG 


AAAAAA66AA 


1500 


ACT6GTGTCC 


ACACAACAGA 


CAAGTGGT6A AGTTGTGAAA 


TTAGGTGTGC 


1550 


ACAATTACTA 


GAAACACCCC 


AAAACCAAAG 


TGAGGTAGAA 


ATAGCATGA6 


1600 


AAGCTGT6TT 


TGAT6TTAAT 


TACAATTAAT 


AATGGACAAA 


ACCCACTCGC 


1650 


TAGAA6TTAA 


TTACACTTGA 


C6TTAGAG6T 


AACAGATTTG 


CAAAAT6ATA 


1700 


66ACAGTGAT 


TTCTATT6A6 


A6AATGCTCT 


TTAAATGCTA 


A6AAGAAGAA 


1750 


ACTGGCAT6A 


GA66AGTAAA 


6CTCTTCCTA 


6CAGTCCTTA 


GCTTTCTGTT 


1800 


GCACTTTTTC 


TCCTGGTTCA 


AT6ACTTGCA 


TTTGTTTAGA 


CATTTCAGCC 


1850 


CGTCAACTAG 


ACCAGAGA6T 


TTGGA6ACGC 


TTTTGCTCTC 


AAAACTTTCC 


1900 


AACCACTGTG 


CCTTCTCACC 


CACAATCCT6 


TGTG6A6TTA 


CTTGCA6GGA 


1950 


AACCAATGCA 


AAGGAGACAA 


ATGCAGTTCA 


TGGGCTTCT6 


GACTGATATT 


2000 


CACCAGGGTC 


ACAATGTGAT 


TGGGTTACTT 


TCTTAACAGT 


AATCCTAA6T 


2050 


CTTGCAGCAT 


TAAAAAAAAA 


AATCATCACA 


ATGAAGAAAA 


AAAAACCCAA 


2100 


AAAATCTAAA 


ATCTAAAATT 


CATCATCATC 


ATCAACAACA 


ACAACAACAA 


2150 


CAACAACAAA 


ACCACCCACT 


TCAGGTTGAG 


TTTAT6AAGA 


GGGCA6AACA 


2200 


ATTTAGTTGT 


AATTATA6A6 


ATGTTTATAT 


6TATAGTTGT 


AAATATTCAT 


2250 


CCATTCTTTT 


ACAGAGTTGT 


TGCTCCCCTC 


ATATAAATTG 


ACTGAGGAGC 


2300 


CGCAACCTTT 


A6CTCCTACC 


ATCTTCCTCC 


TACTGTCTGG 


GAGTTAAAAA 


2350 


TGTCATCT6A 


T6TTCTATTG 


CAGAAACATC 


ATTAAATATA 


ACCCAACAGT 


2400 


A6GAAGTT6A 


ATATATCAGC 


CAACAAATTA 


CTATGATAGT 


AAGTCCTGTG 


2450 


TATTCATTC6 


CATGTTCCTT 


6AAAAAAAT6 


AATCCTCTAG 


CTCTCAGTGG 


2500 
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FIGURE 


12C 






AAA6TTTAAA 


ACTAGAAACA 


TCTGGAGCCC 


TAGACAATAT 


TTTAGTGTGG 


2550 


CGGTA6TCTC 


CTGGCTTT6G 


GCTCCA6GGA 


AAATTCACTC 


TTGCCCAAGC 


2600 


AGATAA6CCC 


A6ATGACTA6 


AAGCAATTTC 


CA'^AGGAAG 


T6GCAAGAAC 


2650 


ATTT6AA6AA 


6TAACTTCAT 


ATCTATTTAT 


CTATATACCT 


ATA6TATTTA 


2700 


TATACTT6TA 


GACATATAGA 


T6TATAAAAT 


6AAAGCCCAT 


A6CCA6CCCC 


2750 


ACTCAGTCAA 


CAATTCTCAA 


AAGAGCAATA 


TGAAGCA6TC 


ATTTGGT6GG 


2800 


6TTC6TAT6C 


AAGAAAATAA AAAAAC6TCA 


TGAATTCCAT 


ATGAATACCA 


2850 


C6CTAAA6TA 


AT6CAAAACA ATGTGCTGCC 


TCAGTGTGTG 


TGT6TGT6T6 


2900 


T6T6T6T6T6 


GT6G6TTC6T 


GCAT6TAT6T 


GTGCGTGT6T 


GTGTGTGTGT 


2950 


GTGTGTGTGT 


GT6TGTGTGC 


GTGTGTGTTT 


GTTTAGGGGT 


TTTTATAAAC 


3000 


AACrriTTTT 


ATAAA6CACA 


CTTTAGTTTA 


CAATCTCTCT 


TTATAACTGT 


3050 


TATAAATTTT 


TAAACAACCC 


AAAAT6C6TT 


CCATATAAA6 


AAATG6CAAG 


3100 


TTATTTAGCT 


ATCAA6ATTT 


TACAT6TTTT 


CTTTTAACTT 


TTTTGTACAA 


3150 


TTGCATAGAC 


GTGTAAAACC 


TGCCATTGTT 


AACAAAACAA 


TAACAGACTT 


3200 


AGAAACTACT 


GAAATCTACA 


GTATAGTACC 


ACTACCCTTC 


ACAAAAATAT 


3250 


A6ATTTTATT 


TCTT6TAAAC 


TCTTACT6TC 


TAATCCTCTT 


TGTTGTAC6A 


3300 


ATATTATAAA 


AACCATGC6G 


6AATCAG6A6 


TTGTAAAACA 


TTTATTCT6C 


3350 


TCCTTCTTCA 


TCT6TCAT6A 


CT6AAACTAA 


G6ACTCCATC 


GCTCTGCCCA 


3400 


AATCATCTGC 


CATGTGGAAA 


AGGCTTCCTA 


CATTGTGTCC 


TCTCTCATTG 


3450 


GCTTTCCGG6 


GGCATTTCTT 


CCTCTTGAAC 


TAGGGAAGGA 


GTTGTTGAGT 


3500 


T6CTCCATCA 


CTTCTTCTAA 


CCCTGTGCTT 


GT6TCCT666 


GA6GACTCA6 


3550 


AAGATCTTCC 


TCACCCATA6 


ATTCT6AA6T 


TTGACTGCCA 


ACCACTCGGA 


3600 


GCAGCATAGG 


CTGACT6CTA 


TCTGACCTCT 


GCA6AGAGGT 


GGAAGGA6A6 


3650 


GACACCGTGG 


TGCCATTCAC 


CTTAGCTTCA 


GCCTGGGGCT 


GCTCCAGGAG 


3700 


CTGTCTCA6T 


CTATGTAACT 


6A6ACTCCAG 


CTGTTTATTG 


TGGTCTTCCA 


3750 
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FIGURE 12D 






G6ATTTGCAT CCTGGCTTCC AGGC6TCCTT T6TGTTG6C6 CA6TA6CTTA 


3800 


GCCTCAGCAA T6A6CTCAGC ATCCCTGGGA CTCTGAG6AG AGGTGGGCAT 


3850 


CATCTCAGGA 


GGAGATGGCA 


GTGGAGACAG GCCTTTATGC 


TCATGCTGCT 


3900 


GCTTCAGGCG 


ATCATATTCT 


GCTTGCAGAT TCCTGTTTTC 


TTCCTCAAGA 


3950 


TCTGCTAGGA 


TTCTCTCTA6 


CTCCCCTCTT TCCTCACTCT 


CTAAGGAAAT 


4000 


CAA6ATCT66 


GCAGGACTAC 


GAGGCTGGCT CAGG6GG6A6 


TCCTGGTTCA 


4050 


AACTTTGGCA 


GTAAT6CTGG 


ATTAACAAAT GTTCATCATC 


TATGCTCTCA 


4100 


TTA6GAGAGA 


TGCTATCATT 


TAGATAAGAT CCATTGCTGT 


TTTCCATTTC 


4150 


TGCTAGCCTG 


CTAGCATAAT 


GTTCAATGCG TGAATGAGTA 


TCATCGTGT6 


4200 


AAAGCTGGGG 


66AC6A66CA 


G6C6GAGAAT CTACT66CCA 


GAAGTTGATC 


4250 


AGAGTAACGG 


6A6TTTCCAT 


GTT6TCCCCC TCTAACACAG TCT6CACTGG 


4300 


CA6GTAGCCC ATTCGG66AT 


GCTTCGCAAA ATACCTTTT6 


6TTC6AAATT 


4350 


TGTTTTTTAG 


TACCTTG6CG 


AAGTCGCGAA CATCTTCTCC 


GGATGTAGTC 


4400 


GGAGTGCAAT 


ACTCTACCAT 


GGGGTAGT6C ATTTTAT66C 


CCTTTGCAAC 


4450 


TOGGCCAGAA 


AAAAAGCAAC 


TTTGGCAGAT GTCATAATTA 


AAATGCTTTA 


4500 


GGCTTCT6TA 


CCTGAATCCA 


ATGATTGGAC ACTCCTTACA 


6ATGTTACAC 


4550 


TTG6CTT6AT 


GCTTGGCA6T 


TTCAGCAGCA GCCACTCTGT 


GCAAGACGGG 


4600 


CAGCCAGACC 


ATAGACTGGG 


GTTCCAGGCG CATCCAGTCA 


AGGAAGAGAG 


4650 


CAGCTTCAAT 


CTCAGGTTTA 


TTATTGGCAA ATTGGAA6CA 


GCTCCTGACA 


4700 


CTCGGCTCAA 


TGTTACTGCC 


CCCAAAGGAA GCAACTTCAC 


CCAACTGTCT 


4750 


TGGGATTTGA 


ATAGAATCAT 


GCAGAAGAAG AiCCCAGCCTA 


CGCTGGTCAC 


4800 


AAAAGCCAGT 


TGAACTTGCC 


ACTTGCTTGA AAAGGTATCT 


GTACTTGTCT 


4850 


TCCAAGT6TG 


CTTTACACAG 


AGAAATGATG CCAGTTTTAA 


AAGACAGGAC 


4900 


ACGGATCCTC 


CCTGTTC6TC 


CC6TATCATA AACATTGA6A AGCCAGTTGA 


4950 


GACACATATC 


CACACA6AGA 


G6GAGATT6A CCAGATT6TT 


GTGCTCTTGC 


5000 


TCCAGACGAT 


CATAAATTGT 


AGTCAAACA6 TTAATTATCT 


GCAGGATATC 


5050 
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CATGGGCTGG TCATTTT6CT 
CA6CTGACAG GCTCAA6A6A 
A6CTTCATGG CAGTCCTATA 
TAAAGACTGG TAGAGCTCTG 
G66TCTC6TG GTTGATATA6 
TCCCAG66AC CCT6AACT6A 
AAA6TCCCT6 T66GCTTCAT 
•CCT6TA6AA6 CCTCCATCT6 
TAAGGTGA6A 6CT6AATGCC 
GACACGATTG ACATTCTCTT 
TGACTTTTTC AAGGTGATCT 
GGCTGCCAGG ATCCCTT6AT 
TTCATCGGCA GCTTCCTGAA 
TTTTTCTCTG CCAATCAGCT 
TTGACCTCTT CAGCCTGCTT 
TTCTTCAGGA GGCAGTTCTC 
CCAAAGGCTG CTCTGTCAGA 
ATTACAGGTT CTTTAGTTTT 
ATTCTGCTTC TGAACTGCTG 
TCAGTTCATC ATCTTTCMSC 
AGATGCAAAC GCTTCCACT6 
GTTGAGAGAC TTTTTCTGAA 
AACGTCTTTG TAACA6GG6T 
ATTTTTTGGC CATTTTCATC 
AATTTCTCCT TGGAGATCTT 
TGGAGTCTTC TAGGAGCTTC 
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FIGURE 12E 

TGAGGTTGT6 CTGGTCCAGG 
TCCAAGCAAA GGGCCTTCTG 
CGCGGAGAAC CTGACATTAT 
TCATTTTGGG GTGGTCCCAA 
TAGGGCACTT TGTTT6GT6A 
A6T6GAAAGG AA6T6CTG66 
6CAGCTGTCT 6ACA06GTCC 
GTATTCA6AT CTTCCAAAGT 
CAGTGTGGTC AGCTGATGTG 
TAAGAGGTGC AATTTCTCCC 
T6CA6A6AGT CAATGAG6A6 
CACCTCAGCT TGGCGCAACT 
GTTCCTGGAG TCTTTCAAGA 
GAGC^CAGGT TCAATTTGTC 
TCGTAG6AGC CGAGT6ACAT 
T6GGCTCCTG GTAGAGTTTC 
AATATTCTCA CAGTCTCCAG 
CAATTCCCTC TTGAAGGCCC 
G6AAATCACC ACCGAT6G6T 
T6TAGCCAAA CAAGAAGTTC 
6TCAGAACTT GCTTCCAAAT 
GTTCACTCCA CTTGAAATTC 
6CTTCATCCG AACCTTCCAG 
AA6ATTGTGA TAGATATCTG 
6CCATGGTTT CATCAGCTCT 
TCCTTACGGG AAGCGTCCTG 



GCATCACATG 




GAGCCTTCT6 


5150 


TCAGGTCAGC 


5200 


CAAGTGGTTT 


5250 


GATGGCTCTC 


5300 


ATGCAGGACC 


CO C A 

5J50 


TCCACAGCCA 


Cil A A 

5400 


GCTGAGGTTA 


54 50 


CAAGGTCAIT 


e e A A 
5500 


CGAAGTGCCT 


ee c A 
5550 


ATCCCCCACT 


e f A A 

5600 


T6AGGTCCAG 


5650 


GCTTCATCTA 


5700 


CCATTCAGC6 


5750 


TCTGAGCTCT 


5800 


TCTAGTCCTT 


5850 


AGTACTCATG 


5900 


TATGTATATC 


e A c A 

5950 


GCCTGACG6C 


^ f\ t\ f\ 

6000 


CT6AA6A6AA 


6050 


GGGACCTAAT 


6100 


ATGTTATCCA 


6150 


GGATCTCAGG 


6200 


TGTGAGTTTC 


6250 


CTGACTCCCC 


6300 


TAGGACATTG 


6350 
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GCA6TTGTTT CT6CTTCC6T 
AGG6AACT6C TGCA6TAATC 
CACTTACTCT TTTATGAAT6 
ATCATGT6TA CTTTTCTG6T 
CAGTGCCAAA TCATTTGCCA 
CTTTGGCCAA CTGCTTGGTT 
6T6TGAGGAC CTTCTTTCCA 
GACCTGTTCG GCTTCTTCCT 
ACATTTCATT CAACTGTTGT 
TCCCACT6AA TCTGAATTCT 
TTCTTGATTG CTGGTTTTGT 
CTTCCAATTG GGGGCGTCTC 
TTGATGATCA TTTCATTGAT 
TGATTTTATA ACTCGATCAA 
AAGCTCGGTT GAAGTCTGCC 
G6CATTTCTA GTTTGGAGAT 
CACTAGAGTA ACAGTCTGAC 
GGGCACGGTC AGGCTGCTTT 
6CCTCCCACT CAGACCTCAG 
GCTTGGTTTT TCCTTATACA 
CATCC6CTTG TTTACCGTGA 
GGTCCTGCCT GACTTGGTTG 
AGAGACCCAC AGAAGCAGGT 
CTTTTAAGTG AACCTCAAGC 
ACCTTTATCC ACT6GAGATT 
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FIGURE 12F 

AATCCAGGAA AGAAACTTCT 
TATGAGTTTC TTCCAAAGCA 
TTTCCCCAAG AAGTATTGAT 
ATCATCAGCA GAATAGTCCC 
CGTCTACACT TATCTGCCGT 
TCTGTGATCT TCTTTTGGAT 
T6A6TCAAGC TTGCCTCTGA 
TAGCTTCCAG CCATTGTGTT 
CTCCTGTTCT GCAGCTGTTC 
TTCAATTC6A TCA6TAATGA 
TTTTCAAATT CT6GGCAGCA 
TGTTCCAAAT CTT6CAGT6T 
GTCTTCCAGA TCACCCACCA 
6CA6A6ACA6 CCAGTCTGTA 
AGTGCAG6TA CCTCCAACAG 
GACAGTTTCC TTAGTAACCA 
TGGCAGAGGC TCCAGTAGTG 
GTCCTCAGCT CCCGAAGTAA 
ATCTTCTAAC TTCCTCTTCA 
AATGCTGCCC TTTCGACAAA 
ACTGTTACTT CAATCTCCTT 
GTTATAAATT TCCAACTGGT 
GATCCAGCTG CTCTTCAAGC 
TCTCCTTGTT TCTCAGGTAA 
T6TCT6TTTG AGCTTCTTTT 







GCCTCTT6CT 




ATTCTCTGTT 


DDUU 


GAA6AAGTTT 


ODDU 


ffffl>« m 0%0^0m Ik /«/«fn 

TGACGGAGGT 


OOUQ 


TGCATCTACT 


A 


CCTGTCCTAT 


o700 


GAATCCTTTA 




TTGAACCTCA 


£0 AA 


TTGTTCTA6C 




GTAATGAGTT 


6900 


T6CCTTCTGT 


6950 


TCACTCTCTG 


7000 


AGTTCTGTCC 


7050 


CAAAGAAGAT 


7100 


CAGATTGT6T 


7150 


CTCAGTCCAG 


7200 


ATGGTTTACA 


7250 


CTGGCTGAGT 


7300 


AGCCTTTCCA 


7350 


TATGTCAAAC 


7400 


TTCTAATAGG 


7450 


TGCCTAAAAT 


7500 


AGCTCTG6AG 


7550 


CAAGTTTATC 


7600 
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TTGCTCTTCT 


6GCCTTATGG 


GA6CACTTAC 


AAGTACTGCT 


CCTCCT6TTT 


7650 


CATTTAATT6 


TTTTAGAATT 


CCCTGGC6CA 


GGGGCAACTC 


TTCT6CCA6T 


7700 


AACTTGACTT 


GTTCAA6TT6 


TTCTTTTA6C 


T6CTGCTCAT 


CTCCAA6TGG 


7750 


A6TAATAGCA 


AT6TTATCT6 


CTTCTTCCA6 


CCACAAAACA 


AATTCATTTA 


7800 


AATCTCTTT6 


AAATTCTGAC 


AA6ACATTCT 


TTTGTTCTTC 


AATCCTCTTT 


7850 


CTCCTTTCTG 


CCAGCTCTTT 


6CAGATGTC6 


T6CCACC6CA 


GACTCAAGCT 


7900 


TCCTAATTTT 


TCTTGTAGAA 


TATTGACATC 


T6TTTTT6AA 


6ACT6TTGAA 


7950 


TTATTTCTTC 


CCCAGTTGCA 


TTCAGTGTTC 


TGACAACAGC 


TTGACGCTGC 


8000 


CCAATGCCAT 


CCTGGAGTTC 


CTTAAGATAC 


CATTTGTATT 


TAGCATGTTC 


8050 


CCA6TTTTCA 


GGATTTT6TG 


TCTTTTT6AA 


AAACTGTTCA 


ACTTCATTCA 


8100 


6CCATTGATT 


AAATACCTTC 


ATATCATAAT 


GAAA6TGTC6 


CCATTTTTCA 


8150 


ACT6ATCTGT 


CGAATCGCCC 


TTCTC6TTCC 


TT6TACATTC 


TAT6AA6TTT 


8200 


TTCCCCCTGG 


AAATCCATCT 


GTGCCACGGC 


TTCCTGTACT 


TTCACCTTTT 


8250 


CCATGGAGGT 


GGCACTTTGC 


AA66CTGCT6 


TCTTCTTCTT 


GTGAATAATA 


8300 


TCAATCC6AC 


CTGAGATTTG 


TT6CAAATT6 


TCTTTTATAT 


TCTTAAGAGA 


8350 


CTCCTCTTGC 


TTAAAAA6AT 


CTTCAAAATC 


TTTAGCACAG 


AGTTCAGGA6 


8400 


TATTTAGAAG 


ATGATCAACT 


TCTTGAAAGAG 


CTT6TAAGAT 


ATGACTGATC 


8450 


TCGGTCAAAT 


AAGTAGAAGG 


CACATAA6AA 


ACATCCAAAG 


GCATATCTTC 


8500 


AGTCGTCACT 


ACCATAGTTT CTTCATGGAG 


A6T6TGAATT 


T6T6CAAA6T 


8550 


TGAGTCTTCG 


AAACTGAGGA 


AAATTGCTCT 


CAATTTGCC6 


CCAGC6CTTG 


8600 


CT6A6CTGGA 


TCTGAGTTGG 


CTCCACT6CC 


ATTGC66CCC 


CATTCTCAGA 


8650 


CAAGCCCTCA 


GCTTGCCT6C 


GCACTGCATT 


CAGCTCCTCT 


TTCTTCTTCT 


8700 


GCAATTCACG 


ATCAATTTCC 


TTTAATTTTC 


TTTCATCTCT 


G6GTTCAGGT 


8750 


AGGCTGGCTA 


AriTTTTlTC 


AATTTCATCC 


AAGCATTTCA 


GGA6ATCATC 


8800 


A6CCTGCCTC 


TTGTACTGAT ACCACTGGT6 


A6AAATTTCT 


AGGGCCTTTT 


8850 
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TTCTTCTTTG 


AGACCTCAAA 


TCCTT6A6AG 


CATTATGTTT 


TGTCT6TAAC 


8900 


AGCT6CTGTT TTATCTTTAT 


TTCCTCTCGC 


TTTCTCTCAT 


CTGT6ATTCT 


8950 


TTGTTGTAA6 


TTGTCTCCTC 


TTTGCAACAA 


TTCATTTACA 


GTACCCTCAT 


9000 


TGTCTTCACT 


CATATCTTTA 


TTGAAGTCTT 


CCTCTTTCAG 


ATTCACCCCC 


9050 


TGCTGAATTT 


CAGCCTCCAG 


TGGTTCAAGC 


AATTTTTGTA 


TATCTGAGTT 


9100 


AAACTGCTCC AATTCCTTCA 


AAG6AATGGA 


GGCCTTTCCA 


GTCTTAATTC 


9150 


TGTGAGAAAT 


AGCTGCAAAT 


C6ACGGTTGA 


GCTCAGAGAT 


TTGGGGCTCT 


9200 


ACTACTTTCC 


TGCAGTGGTC 


ACCGCGGTTT 


GCCATCAATT 


TTGCTGCTTG 


.9250 


GTCACGTGTG 


GAGTCCACCT 


TTGGGC6CAT 


GTCATTCATT 


TCAGCCTTTA 


9300 


AACGCTTAAG 


AATGTCTTCC 


TTTTGTTGTG 


6TTTCTTCTT 


TTCAGACTCA 


9350 


TCTAAAAGTT 


CATCTGCATG 


AATGATCCAC 


TTTGTGATTT 


GTTCTATGTT 


9400 


CTGATCAAAG 


GTTTCCATGT 


GTTTCTG6TA 


TTCCAACAAA 


AGATTTAGCC 


9450 


ATTCTTCTAC 


TCTGGAGGTG 


ACAGCTATCC 


AGTTACTGTT 


CAGAAGACTC 


9500 


AGTTTATCTT 


CTACCAAGGT 


TTCTTTCTTG 


CCCAACACCA 


TTTTCAAAGA 


9550 


CTCTCCTAAT 


TCTGTAACAC 


TCTTCAAGTG 


AGCCTTCTGT 


TTCTCAATCT 


9600 


CTTTTTGAGT 


AGCCTTTCCC 


CAGGCAACTT 


CAGAATCCAA 


ATTACTTGGC 


9650 


ATTCCTTCAA 


CTGCTGATCT 


CTTCGTCAAT 


TCTGTATCTG 


TTGCTGCCAG 


9700 


CCATTCTGTT 


AAGACATTCA 


TTTCCTTTCT 


CATCTTACGG 


GACAACTTCA 


9750 


AGCATTTCTC 


CAACT6TTGC 


TTTCTCTCTG 


TTACCTTCGC 


ACCCAACTCA 


9800 


TTGTAATGCA 


ATTTCAAAGC 


TGTTACTCGT 


TCATCAAGCT 


CTTTGGGATT 


9850 


TTCTGTCTGC 


TTTTTCTGTA 


CAATTTGACG 


TCCGGTTTTA 


ATCACCATTT 


9900 


CCACTTCAGA 


CTTGACTTCA 


CTCAGGCTTT 


TATACAAGTT 


CACACAATGA 


9950 


CTTA6TTGTG 


ACTGAATTAC 


TTCCTGTTCA 


ACACTCTTGG 


TTTCCAATGC 


10000 


AGGCAAATGC 


ATCTTGACTT 


CATCTAAAAT 


Ca^TCTTACTT 


TCCTCTAGAC 


10050 


GTTGTTCAAA 


ATTGGCTGGT 


TTTTGGAATA 


ATCGAAATTT 


CATGGAGACA 


10100 


TCTTGTAATT 


TTTTCTGTGC 


AACATCAATT 


TGTGAAA6AA 


CCCTTTGGTT 


10150 
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FIGURE 


121 






G6CATCCTTC 


CCCTGGTTAT 


GTTTCTTCAT 


TTCTTCTAAA 


CTTATCTCAT 


10200 


6ACTT6TCAA 


ATCTGATTGG 


ATTTTCTGGG 


CTTCCT6AGG 


CATTTGAGCT 


10250 


6CATCCACCT 


TGTCAGT6AT 


ATAAGCTGCC 


AACTGCTTGT 


CAATGAATTC 


10300 


AA6C6ACTCC 


TGAATTAAGT 


GCAAGGACTT 


TTCAATTTCC 


TGGGCAGACT 


10350 


GGATACTCT6 


TTCAAGCAAC 


TTTTGTTTCC 


TCACAGCCTC 


TTCATGTA6T 


10400 


TCCCTCCAAC 


GAGAATTAAA 


CGTCTCAAGC 


TCCTCATTGA 


TCAGTTCATC 


10450 


CATGACTCCT 


CCATCTGTAA 


GAGTCTGTGC 


CAATAGACGA 


ATCT6ATTT6 


10500 


GGTTCTCCTC 


TGAATGATGC 


ATCAGATTTT 


CAAGAGATTC 


TAGCACTTCA 


10550 


6TGA.TTTCCT 


CAGGTCCTGC 


AGGAACATTT 


TCCATGGTTT 


TAAGTTTCAA 


10600 


TTCTACTTCA TTGAGCCACT 


TGTTTGCTTT 


CTCTAAATAT 
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TGGTCACCAT 


12400 


A6TTACC6TT 


TCCATTACAG 


TTGTCTGTGT 


TAGGGATGGT 


TGAGTGGTGG 


12450 


TGACAGCCTG 


TGAAATTTGT 


GCTGAACTCT 


TTTCAAGTTT 
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6ATCTATTTT 


TAAAGTA6AT 
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TTACTTCC6T 


13450 


CTCCATCAAT 


GAACTGTCAA 


GTGACTTGTC 


TCTGGGAGCT 


TCCAAATGCT 
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TtACTTTTGA 


A6ATGTCCTG 
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GGCAACATTT 


CCACTTCTTG 


AATGGCTTCA 


ATGCTCACTT 


GTTGTGGCAA 
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AACTTGAAAG 


AGTGATGTGA 


T6TACATTAA 


GATGGACTTC 


TTGTCTGGAT 


13800 


AAGTGGTAGC 


AACATCTTCA 


6GATCAA6AA 


GTTTTTCTAT 


GCCTAACTGG 
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CATTTTGCAA 


T6TTGAAGGC 


AT6TTCCAGT 


CTTTGGGTGG 


CTGAGTGCTG 


13900 
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T6AAACCACA CTATTCCAAT 
GA6CATTCAA A6CCAACCC6 
TTAACCTGT6 GATAATTAC6 
CTTTTCACTG TTGGTTT6CT 
TTTTGACCTG CCAGTGGAGG 
TTATGATTTC CATCCACTAT 
ATTATTTTTC TGTAAGACCC 
GAACTCTTGT AGATCCCTTT 
TCCAAGA6GT CTAGGAG6CG 
GTCTATGT6T TGCTTTCCAA 
TGAATGTTTT CTTTTGAACA 
TCCCACGAAA GCATTTGGAA 
ATCTTGGTAA AAGTTTCTCC 
GATGAGAA6C CAATAAACTT 
TAGCACTTCA AGTCTTCCTA 
AGAGCGGAAT TCCTGCAGCC 
GGGTACAATT CCGCAGCTTT 
TAGAAGTAAA GGCAACATCC 
CACCGGATCC GGGACCTGAA 
TAACTTTCTG GTTTTTCAGT 
AGGCTGGATC GGTCCCGGTG 
GCGTCTCCAG GCGATCTGAC 
CCTCCCACCG TACAC6CCTA 
TTACGACATT TTGGAAAGTC 
ATTGACGTCA ATGGGGTGGA 
ATCCACGCCC ATTGATGTAC 
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FIGURE 12L 
CAAACAGGTC GGGCCTGTGA 
TCGGACCAGC TAGAGGTGAA 
TGTTGACTGT CGAACCCAGC 
6CAATCCA6C CTTGATAGTT 
ATTATATTCC AAATCAAACC 
GTCAGTGCTT CCTATATTCA 
GCAGTGCCTT GTTGACATTG 
TCTTTTGGCA GTTTTTGCCC 
TTTTCCATCC T6CAGGTCAC 
ACTTAGAAAA TTGTGCATTT 
TCTTCrCTTT CATAACAGTC 
GAAAAAGTAT ATATCAAGGC 
CAGTTTTATT GCTCCAG6A6 
CAGCAGCCTT GACAAAAAAA 
TTCGTTTTTT CTATAAAGCT 
CGGGGGATCC ACTAGTTCTA 
TAGAGCAGAA GTAACACTTC 
ACT6AGGAGC AGTTCTTTGA 
ATAAAAGACA AAAAGACTAA 
TCCTCGAGTA CCGGATCCTC 
TCTTCTATGG AG6TCAAAAC 
GGTTCACTAA ACGAGCTCTG 
CCGCCCATTT GCGTCAATGG 
CCGTT6ATTT TGGTGCCAAA 
GACTTGGAAA TCCCCGTGAG 
TGCCAAAACC GCATCACCAT 
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CTATGGATAA 


13950 


GTTGATGACG 
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TCA6AA6AAT 
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TTCATCACAT 


14100 


AA6A6T6AGT 
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CTAAATCAAC 
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TTCAGGGCAT 
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TGTAAGGCCT 
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TGAA6A6GTT 
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CTCTACTTCT 
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AGGGATAAAA 
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ATTGCCTTCA 
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GAGCGGCCGC 
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CGTACAGGCC 
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TTTGCACCAC 
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ACTTACCAGT 
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TAGAGTCCGG 


14900 


AGCGTGGATG 
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CTTATATAGA 
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6GCGGAGTTG 
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ACAAACTCCC 
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TCAAACCGCT 


15150 


GGTAATAGCG 


15200 
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ATGACTAATA CGTAGATGTA 
GTACT6GGCA TAATGCCA6G 
GGGCCGTACT TGGCATAT6A 
TACCGTAAAT ACTCCACCCA 
TACTATG66A ACATAC6TCA 
GCGGTCAGCC A66CGG6CCA 
TCGACTCTAG AG6ATCTCCC 
AAAATTATTC AGATTTCACT 
CCAAATCTTA CTCGGTTAC6 
C60GC6AAAA TT6TCACTTC 
ACTTTTGCCA CATCCGTCGC 
ACACTTCCGC CACACTACTA 
CACGTCACAA ACTCCACCCC 
AAGGTATATT ATTGATGATG 
CAATGATCAT CATGACAGAT 
TTGCGCATGC TAGCTATAGT 
GCTACGTATA CTCCGGAATA 
GCCGCCTGCA GCTGGCGCCA 
GTACAGAGCT CGAGAAGTAC 
TTGGCACTGG CC6TCGTTTT 
TACCCAACTT AATCGCCTTG 
ATAGCGAAGA GGCCCGCACC 
AATGGCGAAT GGCGCCTGAT 
TATTTCACAC CGCATAC6TC 
CGCATTAAGC GCGGCGG6TG 
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FIGURE 12M 

CTGCCAAGTA 66AAAGTCCC 
CGGGCCATTT ACCGTCATT6 
TACACTTGAT GTACTGCCAA 
TTGACGTCAA TGGAAA6TCC 
TTATTGACGT CAATGGGCGG 
TTTACCGTAA 6TTATGTAAC 
TAGACAAATA TTACGCGCTA 
TCCTCTTATT CAGTTTTCCC 
CCCAAATTTA CTACAACATC 
CTGT6TACAC GGGCGCACAC 
TTACATGT6T TCCGCCACAC 
CGTCACCCGC CCCGTTCCCA 
CTCATTATCA TATTGGCTTC 
CTAGCGGGGC CCTATATATG 
CTGCGCG06A TCGATATCAG 
TCTAGAGGTA CCGGTTGTTA 
TTAATAGGCC TAGGATGCAT 
TCGATACGCG TACGTCGCGA 
TAGTGGCCAC 6TGGGCCGTG 
ACAACGTCGT GACTGGGAAA 
CAGCACATCC CCCTTTCGCC 
GATCGCCCTT CCCAACAGTT 
GC6GTATTTT CTCCTTACGC 
AAAGCAACCA TAGTACGCGC 
TGGTGGTTAC GCGCAGCGTG 



PCT/US9S/14017 





15250 




15100 


vXvwvWAvx X 


16350 


CXiiX Xuuv«uX 


15400 


X WW X XA9\9 


15450 




15500 


X^V A w X AAWA W 


15550 




15600 




15650 


WAAAAAWVX W 


15700 


X XwWiiAWJlX W 


15750 


vGwwwwvwvW 




AATCCAAAAT 




GATCCAATTG 


n conn 
159U0 


CGCTTTAAAT 


4 e A c A 
15950 


ACGTTAGCCG 


XDUUU 


ATGGwGGwwu 


XOU9w 


W W\9 W viv A WA X 


16100 


CACCTTAAGC 


16150 


ACCCTGGCGT 


16200 


AGCTGGC6TA 


16250 


6CGCAGCCTG 


16300 


ATCTGTGCGG 


16350 


CCT6TAGCGG 


16400 


ACCX;CTACAC 


16450 



S0BsiinnEsiisr(iiiu2t) 



wo 96/13597 



PCT/US95/14017 



43/45 







FIGURE 


12N 






TT6CCA6CGC 


CCTAGC6CCC 


GCTCCTTTCG 


CTTTCTTCCC 


TTCCTTTCTC 


16500 


GCCACGTTCG 


CCGGCTTTCC 


CC6TCAA6CT 


CTAAATCGGG 


GGCTCCCTTT 


16550 


A6G6TTCC6A 


TTTAGTGCTT 


TACGGCACCT 


C6ACCCCAAA 


AAACTT6ATT 


16600 


TGGGT6ATG6 


TTCACGTAGT 


GGGCCATCGC 


CCTGATAGAC 


GGTTTTTCGC 


16650 


CCTTT6ACGT 


TGGAGTCCAC 


GTTCTTTAAT 


AGTG6ACTCT 


TGTTCCAAAC 


16700 


T6GAACAACA 


CTCAACCCTA 


TCTOGGGCTA 


TTCTTTTGAT 


TTATAAGGGA 


16750 


TTTTGCCGAT 


TTCGGCCTAT 


TGGTTAAAAA 


ATGAGCTGAT 


TTAACAAAAA 


16800 


TTTAACGCGA 


ATTTTAACAA 


AATATTAACG 


TTTACAATTT 


TATGGTGCAC 


16850 


TCTCAGTACA 


ATCTGCTCTG 


ATGCCGCATA 


GTTAAGCCAG 


CCCCGACACC 


16900 


C6CCAACACC 


0GCT6A0GCG 


CCCTGAOGGG 


CTT6TCTGCT 


CCCGGCATCC 


16950 


6CTTACA6AC 


AAGCTGTGAC 


CGTCTCOGGG 


AGCTGCATGT 


GTCAGAGGTT 
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TTCACCGTCA 


TCACCGAAAC 


GCGCGAGACG 


AAAGGGCCTC 


GTGATACGCC 
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TATTTTTATA 


GGTTAATGTC 


ATGATAATAA 


TGGTTTCTTA 


GACGTCAGGT 


17100 


G6CACTTTTC 


GGGGAAATGT 


GC6CGGAACC 


CCTATTTGTT 


TATTTTTCTA 


17150 


AATACATTCA 


AATATGTATC 


CGCTCAT6AG 


ACAATAACCC 


TGATAAATGC 
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TTCAATAATA 


TTGAAAAA66 


AAGAGTAT6A 


GTATTCAACA 


TTTCCGTGTC 
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GCCCTTATTC 
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GGCATTTTGC 


CTTCCTGTTT 


TTGCTCACCC 


17300 


AGAAACGCTG 


GTGAAAGTAA 


AAGATGCTGA 


A6ATCAGTTG 


GGTGCACGAG 
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TGGGTTACAT 


CGAACTGGAT 


CTCAACAGCG 


GTAAGATCCT 


TGAGAGTTTT 
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AACGTTTTCC 


AATGATGAGC 


ACTTTTAAAG 


TTCTGCTATG 
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TGGCGCGGTA 


TTATCCCGTA 


TTGAC6CCGG 
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CTCGGTCGCC 
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GCATACACTA 
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GACTTGGTTG 


AGTACTCACC 


AGTCACAGAA 
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AAGCATCTTA 


CGGATGGCAT 
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GAATTATGCA 


GTGCTGCCAT 
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AACCATGAGT 


GATAACACTG 


CGGCCAACTT 


ACTTCTGACA 


ACGATCGGAG 


17650 


6ACCGAAGGA 


GCTAACCGCT TTTTTGCACA ACATGGGGGA TCATGTAACT 


17700 
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